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Novel Serine Protease Genes Related to DPPIV 

Field of the Invention 

The present invention relates to novel serine proteases related to dipeptidyl 
peptidase IV (DPPP/), and to isolated nucleic acids coding for these proteases, all of 
which are useful for the discovery of new therapeutic agents, for measuring protease 
5 activity, and for determining the inhibitory activity of compounds against these 
proteases. 

Background of the Invention 

Proteases and peptidases are enzymes that catalyse the hydrolysis of peptidic 
amide bonds. Proteases play an important role in the regulation of biological processes 

10 in almost every life-form from bacteria to virus to mammals. They perform critical 
functions in, for example, digestion, blood clotting, apoptosis, activation of immime 
responses, zymogen activation, viral maturation, protein secretion and protein 
trafficking. They can be classified according to a number of criteria, such as site of 
action, substrate preference, and mechanism. So, for example, aminopeptidases act 

15 preferentially at the N-terminal residues of a peptide, while carboxypeptidases act 
preferentially at the C-terminus and endopeptidases act at sites removed from the two 
termini. Among the carboxy- and aminopeptidases, peptidyl peptidases cleave a single 
amino acid residue from the substrate, dipeptidyl peptidases cleave a dipeptide unit (two 
amino acids) from the substrate, and tripeptidases cleave three amino acids from the 

20 substrate. Substrate preference is frequently expressed in terms of the amino acid 
residue immediately N-terminal to the cleavage site. For example, trypsin-Iike 
peptidases will preferentially cleave a peptide next to a basic amino acid (arginine or 
lysine), i.e. where the bond hydrolysed is the Arg/Lys-Xaa bond. As another example, 
the chymotrypsin-like family of peptidases preferentially hydrolyse peptides adjacent to 

25 an aromatic residue. Mechanistically, peptidases are classified as being serine- 
dependent, cysteine-dependent, aspartic acid-dependent or zinc-dependent. 

Because peptidases and proteases are involved in the regulation of many 
physiological processes, they are attractive targets for the development of therapeutic 
agents. Protease and peptidase inhibitors are, for example, used in the treatment of 

30 hypertension, coagulation disorders, and viral infection. 

Proteolytic enzymes that exploit serine in their catalytic activit}^ are ubiquitous, 
being found in viruses, bacteria and eukar>'otes. Over 20 families (denoted 81 - 827) of 
serine protease have been identified; these are grouped into 6 clans (SA, SB, SC, SE, SF 
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and SG) on the basis of structural similarity and other functional evidence. Structures 
are known for four of the clans (SA, SB, SC and SE); these appear to be totally 
unrelated, suggesting at least four evolutionary origins of serine peptidases and possibly 
many more, Rawlings and Barrett, Meth. Enzvmol, 244: 19-61 (1994). 
5 The prolyl oligopeptidase family consists of a number of evolutionarily related 

peptidases whose catalytic activity seems to be provided by a charge relay system similar 
to that of the trypsin family of serine proteases, but which evolved by independent 
convergent evolution. A conserved serine residue has been shown experimentally (in E, 
coli protease II as well as in pig and bacterial PE) to be necessary for the catalytic 
1 0 mechanism. This serine, which is part of the catalytic triad (Ser, His, Asp), is generally 
located about 150 residues away from the C-terminal extremity of these enzymes (which 
are all proteins that contains about 700 to 800 amino acids). 

One of the most intensively studied prolyl oligopeptidases is dipeptidyl peptidase 
IV (DPPIV, EC 3.414.5), a type II glycoprotein, which is the only well characterised 
15 dipeptidyl aminopeptidase known to be located on the outer side of plasma membranes. 
As indicated above, dipeptidyl aminopeptidases are characterised by their ability to cleave 
N-terminal dipeptides from a variety of small peptides. Dipeptidyl aminopeptidases show 
different substrate specificities and cellular localisation, suggesting different functions of 
each activity in peptide processing. DPPIV is characterised by its capacity to cleave N- 
20 terminal dipeptides containing proline or alanine as the penultimate residue. The DPPIV 
gene spans approximately 70 kb and contains 26 exons, ranging in size from 45 bp to 1.4 
kb. The nucleotide sequence (3,465 bp) of the cDNA contains an open reading frame 
encoding a polypeptide comprising 766 amino acids. The nucleotides that encode the 
active site sequence (G-W-S-Y-G) are split between 2 exons. This clearly distinguishes 
25 the genomic organisation of the prolyl oligopeptidase family from that of the classic serine 
protease family. 

DPPIV is widely distributed in mammalian tissues and is found in great 
abundance in the kidney, intestinal epithelium and placenta (Yaron, A. and Naider, F., 
Critical Reviews in Biochem. Mol. BioL 1993 [1], 31). In the human immune system, 
30 the enzyme is expressed almost exclusively by activated T-lymphocytes of the CD4'' 
type where the enzyme has been shown to be synonymous with the cell-surface antigen 
CD26. Although the exact role of DP-IV in human physiology is still not completely 
understood, recent research has shown that the enzyme cleariy has a major role in human 
physiology and pathophysiology. 
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O,. human T cells, DPPIV expression appears late in thymic differentiation and is 
Brefercntially restricted to the CD4* helper/memory population, and CD26 can deliver a 
potent co-stimulatory T-cell activation signal. DPPIV, also known as T-cell activation 
antigen CD26, therefore plays an important role in the immune response via assocatton 
5 with CD45 tyrosine phosphatase and, through iu ability to bind adenosine deaminase 
(ADA) to the T-cell surface, protects the T-cell from adenosine-mediated inhibttion of 
proliferation. Furthermore, the regulation of the function of chemokines by 
CD26/DPPIV appears to be essential for lymphocyte trafficking and infecttvtty of HIV 
strains DPPIV has been associated with numerous functions including involvement m 
,0 T-cell activation, cell adhesion, digeshon of proline containing peptides in the tadney 
and intestines, HIV infection and apoptosis. and regulation of tumorigenioity in certam 
rellma cells. Pethiyagoda e. a,.. ai..E.ia^^ 
is also implicated in the endocrine regulation and metabolic physiology. More 
particularly, DPPIV cleaves the amino-terminal His-Ala dipeptide of QLP-1, geueraung 
15 a GLP-1 receptor antagonist, and thereby shortens the physiological response to OLP-I. 
Glucagon-like peptide-1 (GLP-1), an incretin that induces glucose-dependent msulm 
secretL, is rapidiy degraded by DPPIV, and stace the half-life for DPPIV cleavage ts 
much shorter than the half-life for removal of GLP-1 from circulation, a s,gmf,cant 
increase inGLP-1 bioactivity (5- to 10- fold) is anticipated from DPP-IV inh.b.fon. 
20 inhibitors of DPPIV are currently being studied in the clinic as potential therapeuUc 
attents for type 2 diabetes and impaired glucose tolerance. 

Various different inhibitors of DPPIV were known in 1993. One of these is a 
suicide inhibitor N-Ala-Pro-0-(nitrobenzoyl-)hydroxylamine. Another is acom^^^^^^ 

inhibitor: e-(4-nitro) benzoxyoarbonyl-Lys-Pro, and another is a polyclonal rabbtt anU- 
25 porcine kidney DPPIV immunoglobulin. Others have since been developed and ^e 

described in detail in U.S. Patents Nos. 5,939.560. 6,1 10.949m 6.01 .155 and 5^62. ■ 
In addition to, bm independem of, its serine type calalyttc aatvtty, DPPIV btnds 
closely to the soluble extracellular enzyme adenosine deaminase (ADA), actmg as a 
receptor and is thought to mediate signal transduction. DPPIV strucmre is charactenzed 
30 by two extracellular domains, an ct/p fold hydrolase domain and a 7-blade beta-propeller 
domain consisting of repeated beta sheets of about 50 ammo acids. Recently it has been 
shown that, besides selecting substrates by size, the beta-propeller domain, containing 10 
of the 12 highly conser,.ed cysteine residues, contributes to catalysis of the peptidase 
domain. In addition, the cysteine-rich domain is responsible for DPFIV-binding to 
35 collaaen I and to extracellular ADA. DPPIV is also reported to play a role m fibronectm- 
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mediated interactions of cells with extracellular matrix. Recent studies show that the 
protease activity of DPPIV is not required for its anti-invasive activity because mutants 
of DPPIV that lack the extracellular serine protease activity maintain such activity. 

A number of proteins that share similarities with DPPIV have been reported in 

5 the literature. Several of these proteins have been cloned including DPP-I, DPP-II, 
DPP-III, DPP-X and fibroblast activation protein (FAP). These have been identified and 
characterised either by molecular cloning and ftmctional studies of expressed proteins or 
as biochemical activities in tissue extracts. DPPIV-beta and other novel peptidases with 
functional similarities to DPPIV are not yet cloned. The identification, characterization 

1 0 and/or appropriate classification of further members of the family of prolyl 
oligopeptidases, the elucidation of their physiological (and particularly 
pathophysiological) role, and the application of that knowledge to the development of 
new therapeutic agents are significant challenges. 

Summary of the Invention 

1 5 The present invention provides proteins with prolyloligopeptidase (post-proline 

cleaving) activities that constitute three novel members of a family of proteins related to 
DPPIV, including the full-length proteins, alternative splice forms, subunits, and 
mutants, as well as nucleotide sequences encoding the same. The present invention also 
provides methods of screening for substrates, interacting proteins, agonists, antagonists 

20 or inhibitors of the above proteins, and furthermore to pharmaceutical compositions 
comprising the proteins and/or mutants, derivatives and/or analogues thereof and/or 
ligands thereto. 

These novel proteins having significant sequence homology to DPPIV are termed 
dipeptidyl peptidase IV-related erotein-l, 2 & 3 (DPRP-1, DPRP-2 and DPRP-3). The 
25 amino acid sequences of DPRP-1, DPRP-2 and DPRP-3 are given in SEQ. ID N0S:1, 3 
and 5 respectively. Further disclosed are nucleic acid sequences coding for these 
proteins (SEQ. ID N0S:2, 4 and 6). Table 1 illustrates the homology (i.e. similarity) 
between the novel proteins DPRP-1, DPRP-2 and DPRP-3 and other known serine 
proteases. 
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Table 1 - Comparison of the sequences of these three novel proteins with DPPIV 
and other Clan SC, Family S9 members and Subfamily B members 
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The greatest homology between DPRP-l, DPRP-2 and DPPIV is seen in the C- 
terminal sequences. On the basis of sequence homology with DPPIV (see Figure 1), one 

1 5 might predict that these DPRP proteins would have functions that include, but are not 
limited to, roles as enzymes. Cloning, expression, biochemical and molecular 
characterization have confirmed this hypothesis. 

The expression pattern of DPRPs and the localization to specialized epithelial 
cells and plasma cells (Leydig cells, prostate epithelial cells, lymphocytes, B cells) is 

20 consistent with a role in differentiation, proliferation and inflammation. The localization 
of the DPRP-l gene in hormone sensitive cancers (breast, prostate, testicular), tissues 
regulated by testosterone and the abundant expression in poorly differentiated cancers, 
demonstrate that DPRP-activating or inhibiting molecules will have numerous 
therapeutic applications in the treatment of disorders characterized by disregulated 

25 growth, differentiation and steroid or polypeptide hormone synthesis and degradation. 
Data disclosed herein supports the hypothesis that DPRP-l and DPRP-2 are involved in 
the regulation of proliferation of i?t vitro models of prostate and testis cancer well known 
to those skilled in the art. 

DPRP-l and DPRP-2 activities described herein and their expression patterns are 

30 compatible with their having functional roles as physiological regulators of the immune 
and neuroendocrint: systems through the enzymatic modification of biochemical 
mediators like peptides and chemokines. The numerous functions previously described 
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for DPPIV based upon the use of inhibitors may be due in part to its action and that of 
similar proteins, like the DPRPs. Therefore, the discovery of selective and potent 
inhibitors of DPPIV, of the DPRPs and of other related proteases like FAP is considered 
central to achieving effective and safe pharmaceutical use of these and any newly 
5 identified serine protease inhibitors, as well as other active compounds that modify the 
function(s) of such proteins. 

The invention thus provides novel proteins or polypeptides, the nucleic acids 
coding therefor, cells which have been modified with the nucleic acid so as to express 
these proteins, antibodies to these proteins, a screening method for the discovery of new 

10 therapeutic agents which are inhibitors of the activity of these proteins (or which are 
inhibitors of DPPIV and not of the proteins), and therapeutic agents discovered by such 
screening methods. The novel proteins and the nucleic acids coding therefor can be used 
to discover new therapeutic agents for the treatment of certain diseases, such as for 
example, reproductive, inflammatory and metabohc disorders and also in the preparation 

1 5 of antibodies with therapeutic or diagnostic value. 

In accordance with one aspect of the present invention, there are provided novel, 
mature, biologically active proteins, principally of human origin. Such proteins may be 
isolated in small quantities from suitable animal (including human) tissue or biological 
fluids by standard techniques; however, larger quantities are more conveniently prepared 

20 in cultures of cells genetically modified so as to express the protein. 

In accordance with another aspect of the present invention, there are provided 
isolated nucleic acid molecules encoding polypeptides of the present invention including 
mRNAs, DNAs, cDNAs, genomic DNAs thereof 

In accordance with a further aspect of the present invention, nucleic acid probes 

25 are also provided comprising nucleic acid molecules of sufficient length to specifically 
hybridize to a nucleic acid sequence of the present invention. 

In accordance with a still further aspect of the present invention, processes 
utilizing recombinant techniques are provided for producing such polypeptides useful for 
in vitro scientific research, for example, synthesis of DNA and manufacture of DNA 

30 vectors. Processes for producing such polypeptides include culturing recombinant 
prokaryotic and/or eukaryotic host cells that have been transfected with DNA vectors 
containing a nucleic acid sequence encoding such a polypeptide and/or the mature 
protein under conditions promoting expression of such protein and subsequent recovery 
of such protein or a fragment of the expressed product. 
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In accordance with still another aspect, the invention provides methods for using 
DPRP polypeptides and polynucleotides, including the treatment of infections, such as 
bacterial, fungal, protozoan and viral infections, particularly infections caused by HIV-1 
or HIV-2, pain, diabetes, precocious puberty, infertility, obesity, anorexia, bulimia, 
5 Parkinson's disease, acute heart failure, hypotension, hypertension, urinary retention, 
osteoporosis, angina pectoris, myocardial infarction, stroke, ulcers, asthma, allergies, 
benign prostatic hypertrophy, cancers including hormone-sensitive and androgen- 
independent cancers, migraines, vomiting, psychotic and neurological disorders, 
includmg anxiety, schizophrenia, manic depression, depression, dementia, and severe 

10 mental retardation, and dyskinesias, heremafter collectively referred to as "the Diseases". 

In accordance with yet another aspect of the present invention, there is provided a 
process for utilizing such polypeptides, or polynucleotides encoding such polypeptides, 
for the discovery of compounds that inhibit the biological activity of the mature proteins 
thereof, e.g. by cleaving an N-termuial dipeptide, and such inhibitors are thus also 

15 provided. 

In accordance with a more specific aspect, the invention provides iolated nucleic 
acid which encodes (a) a polypeptide which includes the amino acid sequence of one of 
SEQ ID NOS: 1, 3 and 5, or (b) a polypeptide having an amino acid sequence that is at 
least about 70% similar thereto and exhibits the same biological function, or which is an 

20 alternative splice variant of one of SEQ ID NOS:2, 4 and 6, or which is a probe 

comprising at least 14 contiguous nucleotides from said nucleic acid encoding (a) or (b), 
or which is complementary to any one of the foregoing. 

In accordance with another specific aspect, the invention provides a polypeptide 
which may be optionally glycosylated, and which (a) has the amino acid sequence of a 

25 mature protein set forth in any one of SEQ ID NOS:l, 3 and 5; (b) has the amino acid 
sequence of a mature protein having at least about 70% similarity to one of the mature 
proteins of (a) and which exhibits the same biological function; (c) has the amino acid 
sequence of a mature protein having at least about 90% identity with a mature protein of 
any of SEQ ID NOS:l, 3 and 5; or (d) is an immunologically reactive fragment of (a), 

30 In accordance with still another specific aspect, the invention provides a method 

for the screening for a compound capable of inhibiting the enzymatic activity of at least 
one mature protein of the invention, which method comprises incubating said mature 
protein and a suitable substrate for said mature protein in the presence of one or more 
test compounds or salts thereof, measuring the enzymatic activity of said mature protein, 

35 comparing said activity with comparable activity determined in the absence of a test 
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compound, and selecting the test compound or compounds that reduce the enzymatic 
activity, and it also provides a method for screening for a compound capable of 
inhibiting the enzymatic activity of DPP IV that does not inhibit the enzymatic activity of 
at least one mature protein and a suitable substrate in the presence of one or more 
5 inhibitors of DPPIV or salts thereof, measuring the enzymatic activity of said mature 
protein, comparing said activity with comparable activity determined in the absence of 
the DPPIV inhibitor, and selecting a compound that does not reduce the enzymatic 
activity of said mature protein. 

These and other aspects of the present invention should be apparent to those 
10 skilled in the art from the detailed description which follows. 

Brief Description of the Drawings 
FIGS. lA and IB show the co-linear alignment of DPRP-1, DPRP^2, DPRP-3 
and DPPIV, with shading being supplied to indicate the same (black) or similar (gray) 
amino acid residues at a particular location, 
15 FIG. 2 is similar to FIG. 1 and shows co-linear alignment of human and mouse 

DPRP.2. 

FIG. 3 is a graph which shows the effects of various tetrapeptide amide inhibitors 
on dipeptidyl peptidase enzyme activity. 

FIGS. 4A-4C show the effects of three inhibitor compounds on the proliferation 
20 of PC3 prostate cancer cell lines at various doses. 

Detailed Description of the Preferred Embodiments 
In accordance with an aspect of the present invention, there are provided isolated 
nucleic acid sequences (polynucleotides), which encode the mature polypeptides having 
the deduced amino acid sequences of the three DPRP's (SEQ ID NOS:l, 3 and 5). 
25 The polynucleotides of this invention were discovered using a human testis 

cDNA library (DPRP-1), a human colon library (DPRP-2) and a human hypothalamus 
cDNA library (DPRP-3). Isolated nucleic acid for DPRP-1 contains an open reading 
frame encoding a protein of approximately 882 amino acids in length which is 
structurally related to human DPPIV, showing 26% identity, and 41% similarity over the 
30 entire human DPPIV protein sequence. Isolated nucleic acid for DPRP-2 contains an 
open reading frame encoding for a protein of approximately 864 amino acids, which is 
39% similar to the entire DPPIV amino acid sequence. Analysis of DPRP-1 and DPRP- 
2 primary amino acid sequence using hydrophobicity plots predicts that these two 
proteins do not have a transmembrane domain. Despite this fact, it is possible that these 
35 intracellular serine proteases are secreted upon cellular activation. Quiescent cell proline 
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dipeptidase (QPP) is a serine protease that is targeted to intracellular vesicles that are 
distinct from lysosomes (Chiravuri M, et al., J. Immunol. 2000 Nov 15;165(10):5695- 
702 ). This hypothesis expands the potential site(s) and scope of DPRP-1 and DPRP-2 
involvement in mechanisms for post-translational regulation of chemokines, cytokines, 
5 peptides and polypeptides. The fiill length DPRP-3 sequence contains 796 amino acids, 
a signal peptide from 1 to 48, and a transmembrane domain between 34 and 56. The 
mature protein is predicted to be a type II membrane protein and may be cleaved to 
produce a soluble form. The amino acid sequence is set forth in SEQ ID N0:5 , which 
was deduced from SEQ ID N0:6 and has 54% similarity with DPPIV. 
0 Amino acid sequence alignments of these polypeptides with members of the 

prolyloligopeptidase enzyme subfamily S9B show that all three DPRP proteins have 
overall sequence and structural homology to DPPIV and FAP. DPRPs are predicted to 
be a members of the enzyme Clan SC (Serine nucleophile) with catalytic residues in the 
order Ser, Asp, His and the active site sequence (G-W-S-Y-G). 



1 5 Table 2. Homology (i.e. similarity) between DPRP- 1 , DPRP-2, DPRP-3 and members 
of the prolyl oligopeptidase family S9B enzymes. 
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DPRP-1, DPRP-2 and DPRP-3 do not exhibit sequence similarity with any 
members of the classical serine protease families, chymotrypsin and subtilisin. The order 
20 of the catalytic triad residues is different in the three main related SC clan families: His- 
Asp-Ser in chymotrypsin, Asp-His-Ser in subtilisin and Ser-Asp-His in the prolyl 
oligopeptidases. 

As shown in Table 2, DPRP-3 has the highest homology with DPPVI (68% 
homology and 51% identity). Wada et al isolated cDNA clones for DPPVI, a DPPIV- 
25 related protein, from bovine, rat (Wada et al., Proc. Nat. Acad. Sci. RQ ; 197-201 . (1992)) 
and human (Yokotani et al., Hum. Molec. Genet 037-1 O^Q (1993)) brain libraries. 
They demonstrated that, unlike DPPIV, the catalytic triad in DPPVI does not have the 
first serine residue. In DPRP-3 two of the amino acids in the catalytic triad 
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characteristic of the serine protease family are conserved. However, the serine residue 
itself is replaced by glycine. While the absence of the serine residue is hkely to prevent 
protease activity at this site, it is possible that multiple other functions mediated by other 
functional domains of the protein remain intact. 
5 As briefly described above, DPPFV is a multifunctional molecule that exerts 

important functions depending on the expressed cells and tissues, in addition to its 
catalytic activity as a peptidase. DPRP-3 and DPPVI are also likely to maintain multiple 
functions despite the absence of an intact catalytic triad. For example, DPPVI has been 
implicated in the regulation of neuronal plasticity. DPPVI is highly expressed in the 

10 hippocampus, thalamus, hypothalamus and stiatum. In addition, developmental arrest 
and embryonic lethality of rump white Rw/Rw embryos is thought to be due to disruption 
of the DPPIV gene. Rw mutation is associated with a chromosomal inversion spanning 
30 cM of the proximal portion of mouse chromosome 5. Genomic analysis of the DPPVI 
gene on the Rw chromosome places the inversion breakpoint in the coding region 

15 resulting in loss of a significant fraction of the C-terminal region, Hough R.B. et al,, 
Proc.Nat Acad. Sci.. 95 , 13800-13805 (1998). 

The human DPRP-1 gene, predicted to be 32668bp in length, has at least 22 
exons and eight transcripts. It maps to chromosome 15 (NT 010265) at position 
15q21.1 - 15q22-l. The lengths of predicted alternative splice variant transcripts vary 

20 between 602bp and 4523bp (see SEQ ID NOS: 7-22). This is in agreement with the 
multiple transcripts observed by Northern blot analysis (See Example 2). ESTs 
representing the transcripts were found in numerous tissues including senescent 
fibroblasts, T-lymphocytes, germinal center B-cells, germ cell seminoma, testis, 
melanocytes, uterus, ovary breast, multiple sclerosis lesions, pancreas and placenta. 

25 Human DPRP-2 belongs to a gene with at least 27 exons and nine splice variants 

(see SEQ ID NOS:23-40). One SNP was observed in the 3' UTR. (88% (37) C vs. 12% 
(5) T). The DPRP-2 gene maps to region 19pl3.3 of chromosome 19. This location is 
host to a number of disease markers and is associated with various disorders including 
hypocalciuric hypercalcemia, type II cerebellar ataxia, muscular dystrophy, convulsions, 

30 susceptibility to atherosclerosis, psoriasis, ectodermal dysplasia, and acute myeloid 
leukemia. In agreement with the ubiquitous distribution of the mRNA observed by 
Northern blot analysis (see Example 2), DPRP-2 was expressed in a wide variety of 
tissues upon examination of EST's coverage (e.g. over 64 EST's expressed in liver, 
spleen, muscle, melanocytes, heart, lung, placenta, skin, pancreas, stomach, brain 

35 parathyroid gland). 



-10- 



M/O 02/31134 PCT/USOl/31874 

Human DPRP-3 belongs to a gene with at least 23 exons and two splice variants 
(see SEQ ID NOS:4M4). The gene maps to chromosome 2 (NT_005445) at position 
2ql2.3-2ql4.1. Transcripts for DPRP-3 did not show as wide a distribution as DPRP-1 
and DPRP-2. As shown by Northem blot in Example 2, DPRP-3 expression is restricted 
5 to brain and pancreas. ESTs representing the DPRP-3 mRNA were abundant in tissue 
derived from multiple sclerosis lesions, hypothalamus, whole brain and nerves, with a 
few transcripts being foiand in uterus and colon. 

The relationships among human and rodent proteases in clan SC, including 
DPRP-1 DPRP-2 and DPRP-3, were analyzed using Neighbor Joining method (NJ), see 
10 Saitou and Nei, Mol. Biol. Evol.. 4. 406-525 (1987). Phylogenetic analysis shows that 
among the S9 proteases, DPRP-1 and DPRP-2, both lacking a transmembrane domain, 
are distinguished from DPPIV and its closely related proteins like FAP. Similarity is 
shown however between DPPIV and FAP and between DPRP-3 and DPPVI, which are 
all type II membrane proteins. 
15 A database search for additional DPRP-related genes revealed the presence of a 

murine sequence related to DPRP-1. Alignment of this mouse sequence with the novel 
human proteases shows that the mDPRP-I displays considerable homology with its 
human counterpart (FIG. 2). One skilled in the art will readily recognize that the novel 
mouse protease gene can be isolated using the sequence information disclosed herein and 
20 can be readily incoiporated into one of the routinely used expression constructs which 
are well knovm in the art. Use of this disclosed sequence by those skilled in the art to 
generate a transgenic mouse model will employ development of gene-targeting vectors, 
for example, that result in homologous recombination in mouse embryonic stem cells. 
The use of knockout mice in further analysis of the function of DPRP genes is a valuable 
25 tool. 

The polynucleotides of the present invention may be in the form of RNA or in 
the form of DNA; DNA should be understood to include cDNA, genomic DNA, and 
synthetic DNA. The DNA may be double-stranded or single-stranded and, if single- 
stranded, may be the coding strand or non-coding (antisense) strand. The coding 
30 sequence which encodes the mature polypeptide may be identical to the coding sequence 
shown in SEQ ID NOS:2, 4 and 6 respectively, or it may be a different coding sequence 
encoding the same mature polypeptide, as a result of the redundancy or degeneracy of 
the genetic code or a single nucleotide polymorphism. For example, it may also be an 
RNA transcript which includes the entire length of any one of SEQ ID NOS:2, 4 and 6. 



PCT/USOl/31874 

WO 02/31134 

The polynucleotides which encode the mature proteins of SEQ ID NOS:l, 3, 5, 
respectively, may include but are not limited to the coding sequence for the mature 
protein alone; the coding sequence for the mature polypeptide plus additional coding 
sequence, such as a leader or secretory sequence or a proprotein sequence; and the 
5 coding sequence for the mature protein (and optionally additional coding sequence) plus 
non-coding sequence, such as introns or a non-coding sequence 5' and/or 3' of the coding 

sequence for the mature protein. 

Thus, the term "polynucleotide encoding a polypeptide" or the term "nucleic acid 
encoding a polypeptide" should be understood to encompass a polynucleotide or nucleic 
10 acid which includes only coding sequence for the mature protein as well as one which 
includes additional coding and/or non-coding sequence. The terms polynucleotides and 
nucleic acid are used interchangeably. 

The present invention also includes polynucleotides where the coding sequence 
for the mature protein may be fused in the same reading frame to a polynucleotide 
15 sequence which aids in expression and secretion of a polypeptide from a host cell; for 
example a leader sequence which functions as a secretory sequence for controlling 
transport of a polypeptide from the cell may be so fused. The polypeptide havir^g such a 
leader sequence is termed a preprotein or a preproprotein and may have the leader 
sequence cleaved, by the host cell to form the mature form of the protein. These 
20 polynucleotides may have a 5' extended region so that it encodes a proprotein, which is 
the mature protein plus additional amino acid residues at the N-terminus. The 
expression product having such a prosequence is termed a proprotein, which is an 
inactive form of the mature protein; however, once the prosequence is cleaved an active 
mature protein remains. Thus, for example, the polynucleotides of the present invention 
25 may encode mature proteins, or proteins having a prosequence, or proteins having both a 
prosequence and a presequence (leader sequence). 

The polynucleotides of the present invention may also have the coding sequence 
fused in frame to a marker sequence which allows for purification of the polypeptides of 
the present invention. The marker sequence may be a polyhistidine tag, a hemagglutinin 
30 (HA) tag, a c-myc tag or a V5 tag when a mammalian host, e.g. COS-1 cells, is used. 
The HA iag would coirespond to an epitope derived from the influenza hemagglutinin 
protein (Wilson, I., et al.. Cell. 37:767 (1984)), and the c-myc tag may be an eptitope 
from human Myc protein (Evans, G.I. et al., Mol. Cell. Biol. 5; 3610-3616 (1985)). 

The term "gene" means the segment of DNA involved in producing a polypeptide 
35 chain; it includes regions preceding and following the coding region (leader and trailer) 
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as well as intervening sequences (introns) between individual coding segments (exons). 
The term "significant sequence homology" is intended to denote that at least 25%, 
preferably at least 40%, of the amino acid residues are conserved, and that, of the non- 
conserved residues, at least 40% are conservative substitutions. 
5 Fragments of the full-length genes of the present invention may be used as a 

hybridization probe for a cDNA library to isolate full-length cDNA as well as to isolate 
other cDNAs which have significant sequence homology to the gene and will encode 
proteins or polypeptides having similar biological activity or function. By similar 
biological activity or function, for purposes of this application, is meant the ability to 
1 0 cleave an N-terminal dipeptide having Ala or Pro as the penultimate residue or other 
amino acids. Such a probe of this type has at least 14 bases (at least 14 contiguous 
nucleotides from one of SEQ ID N0S:2, 4 or 6), preferably at least 30 bases, and such 
may contain, for example, 50 or more bases. Such probe may also be used to identify a 
cDNA clone corresponding to a full-length transcript and/or a genomic clone or clones 
1 5 that contains the complete gene, including regulatory and promoter regions, exons, and 
introns. Labelled oligonucleotides having a sequence complementary to that of the gene 
of the present invention are useful to screen a library of human cDNA, geijomic DNA or 
mRNA to locate members of the library to which the probe hybridizes. As an example, a 
known DNA sequence may be used to synthesize an oligonucleotide probe which is then 
20 used in screening a library to isolate the coding region of a gene of interest. 

The present invention is considered to further provide polynucleotides which 
hybridize to the hereinabove-described sequences wherein there is at least 70%, 
preferably at least 90%, and more preferably at least 95% identity or similarity between 
the sequences, and thus encode proteins having similar biological activity. Moreover, as 
25 known in the art, there is "similarity" between two polypeptides when the amino acid 
sequences contain the same or conserved amino acid substitutes for each individual 
residue in the sequence. Identity and similarity may be measured using sequence 
analysis software (e.g., Sequence Analysis Software Package of the Genetics Computer 
Group, University of Wisconsin Biotechnology Center, 1710 University Avenue, 
30 Madison, WI 53705). The present invention particularly provides such polynucleotides 
which hybridize under stringent conditions to the hereinabove-described 
polynucleotides. As herein used, the term "stringent conditions" means conditions which 
permit hybridization between polynucleotides sequences and the polynucleotide 
sequences of SEQ ID N0S:2, 4 and 6 where there is at least about 70% identity. 
35 Suitably stringent conditions can be defined by, e.g., the concentrations of salt or 



-13- 



wo 02/31134 PCT/ljsOl/31874 

formamide in the prehybridization and hybridization solutions, or by the hybridization 
temperature, and are well known in the art. In particular, stringency can be increased by 
reducing the concentration of salt, by increasing the concentration of formamide, and/or 
by raising the hybridization temperature. 
5 For example, hybridization under high stringency conditions may employ about 

50% formamide at about 37°C to 42°C, whereas hybridization under reduced stringency 
conditions might employ about 35% to 25% formamide at about 30°C to 35°C. One 
particular set of conditions for hybridization under high stringency conditions employs 
42°C, 50% formamide, 5x. SSPE, 0.3% SDS, and 200 ng/ml sheared and denatured 
10 salmon sperm DNA. For hybridization under reduced stringency, similar conditions as 
described above may be used in 35% formamide at a reduced temperature of 35°C. The 
temperature range corresponding to a particular level of stringency can be further 
narrowed by calculating the purine to pyrimidine ratio of the nucleic acid of interest and 
adjusting the temperature accordingly. Variations on the above ranges and conditions 
1 5 are well known in the art. Preferably, hybridization should occur only if there is at least 
95%, and more preferably at least 97%, identity between the sequences. The 
polynucleotides which hybridize to the hereinabove described polynucleotides in a 
preferred embodiment encode polypeptides which exhibit substantially the same 
biological fiinction or activity as the mature protein encoded by one of the cDNAs of 
20 SEQ ID NOS:2, 4 and 6. 

As mentioned, a suitable polynucleotide probe may have at least 14 bases, 
preferably 30 bases, and more preferably at least 50 bases, and will hybridize to a 
polynucleotide of the present invention which has an identity thereto, as hereinabove 
described, and which may or may not retain activity. For example, such polynucleotides 
25 may be employed as a probe for hybridizing to the polynucleotides of SEQ ID N0S:2, 4 
and 6 respectively, for example, for recovery of such a polynucleotide, or as a diagnostic 
probe, or as a PGR primer. Thus, the present invention includes polynucleotides having 
at least a 70% identity, preferably at least a 90% identity, and more preferably at least a 
95% identity to a polynucleotide which encodes the polypeptides of SEQ ID NOS:l, 3 
30 and 5 respectively, as well as fragments thereof, which fragments preferably have at 
least 30 bases and more preferably at least 50 bases, and to polypeptides encoded by 
such- polynucleotides. 

As is well known in the art, the genetic code is redundant in that certain amino 
acids are coded for by more than one nucleotide triplet (codon), and the invention 
35 includes those polynucleotide sequences which encode the same amino acids using a 
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different codon from that specifically exemplified in the sequences herein. Such a 
polynucleotide sequence is referred to herein as an "equivalent" polynucleotide 
sequence. The present invention further includes variants of the hereinabove described 
polynucleotides which encode for fragments, such as part or all of the mature protein 
5 analogs and derivatives of one of the polypeptides having the deduced amino acid 
sequence of SEQ ID NOS:l, 3 and 5 respectively. The variant fonns of the 
polynucleotides may be a naturally occurring allelic variant of the polynucleotides or a 
non-naturally occurring variant of the polynucleotides. For example, the variant in the 
nucleic acid may simply be a difference in codon sequence for the amino acid resulting 
from the degeneracy of the genetic code, or there may be deletion variants, substitution 
variants and addition or insertion variants. As known in the art, an alleUc variant is an 
alternative fox^ of a polynucleotide sequence which may have a substitution, deletion or 
addition of one or more nucleotides that does not substantially alter the biological 
function of the encoded polypeptide. 
1 5 The present invention further includes polypeptides which have the deduced 

amino acid sequence of SEQ ID N0S:1, 3 and 5, as well as fragments, analogs and 
denvatives of such polypeptides. The terms "fragment," "derivative" and "analog" 
when referring to the polypeptides of SEQ ID NOS: 1, 3 and 5, means polypeptides'that 
retam essentially the same biological fimction or activity as such polypeptides An 
analog might, for example, include a proprotein which can be activated by cleavage of 
the proprotein portion to produce an active mature protein. The polypeptides of the 
present invention may be recombinant polypeptides, natural polypeptides or synthetic 
polypeptide; however, they are preferably recombinant polypeptides, glycosylated or 
unglycosylated. 

The fragment, derivative or analog of a polypeptide of SEQ ID NOS l 3 and 5 
respectively, may be (i) one in which one or more of the amino acid residues is 
substituted with a conserved or non-conserved amino acid residue (preferably a 
conserved amino acid residue) and such substituted amino acid residue may or may not 
be one encoded by the genetic code, or (ii) one in which one or more of the amino acid 
residues includes a substituent group, or (iii) one in which additional amino acids are 
ftised to the mature protein, such as a leader or secretoiy sequence or a sequence which 
IS employed for purification of the mature polypeptide or a proprotein sequence Such 
fragments, derivatives and analogs are deemed to be within the scope of those skilled in 
the art to provide upon the basis of the teachings herein. 
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The polypeptides and polynucleotides of the present invention should be in an 
isolated form, and preferably they are purified to substantial homogeneity or purity. By 
substantial homogeneity is meant a purity of at least about 85%. 

The term "isolated" is used to mean that the material has been removed from its 
5 original environment (e.g., the natural environment if it is naturally occurring). For 
example, a naturally occurring polynucleotide or polypeptide present in a living animal 
is not considered to be isolated, but the same polynucleotide or polypeptide, when 
separated from substantially all of the coexisting materials in the natural system, is 
considered isolated. For DNA, the term includes, for example, a recombinant DNA 

10 which is incorporated into a vector, into an autonomously replicating plasmid or virus, or 
into the genomic DNA of a prokaryote or eukaryote; or which exists as a separate 
molecule (e.g., a cDNA or a genomic or cDNA fragment produced by polymerase chain 
reaction (PGR) or restriction endonuclease digestion) independent of other sequences. It 
also includes a recombinant DNA which is part of a hybrid gene encoding additional 

15 polypeptide sequence, e.g., a fusion protein. Further included is recombinant DNA 

which includes a portion of the nucleotides shown in one of SEQ ID NO:2,4 or 6 which 
encodes an alternative splice variant of the DPRP. Various altemative splice variants are 
exemplified in SEQ ID NOS:8, 10, 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38, 
40, 42, 44 and 46. 

20 The polypeptides of the present invention include any one of the polypeptide of 

SEQ ID NOS:l, 3 and 5 (in particular the mature protein), as well as polypeptides which 
have at least 70% similarity (e.g. preferably at least 60% and more preferably at least 
70% identity) to one of the polypeptides of SEQ ID NOS: 1, 3 and 5, more preferably at 
least 90% similarity (e.g. preferably at least 90% identity) to one of the polypeptides of 

25 SEQ ID NOS:l, 3 and 5, and most preferably at least 95% similarity (e.g. preferably at 
least 95% identity) to one of the polypeptides of SEQ ID NOS: 1 , 3 and 5. Moreover, 
they should preferably include exact portions of such polypeptides containing a sequence 
of at least 30 amino acids, and more preferably at least 50 amino acids. 

Fragments or portions of the polypeptides of the present invention may be 

30 employed as intermediates for producing the corresponding ftill-length polypeptides by 
peptide synthesis. Fragments or portions of the polynucleotides of the present invention 
may also be used to synthesize full-length polynucleotides of the present invention. 

The present invention also includes vectors which include such polynucleotides, 
host cells which are genetically engineered with such vectors and the production of 

35 polypeptides by recombinant techniques using the foregoing. Host cells are genetically 
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engineered (transduced or transformed or transfected) with such vectors which may be, 
for example, a cloning vector or an expression vector. The vector may be, for example, 
in the form of a plasmid, a viral particle, a phage, etc. The engineered host cells can be 
cultured in conventional nutrient media modified as appropriate for activating promoters, 
5 selecting transformants or amplifying the genes of the present invention. The culture 
conditions, such as temperature, pH and the like, are those commonly used with the host 
cell selected for expression, as well known to the ordinarily skilled artisan. 

The polynucleotides of the present invention may be employed for producing 
polypeptides by recombinant techniques. Thus, for example, the polynucleotides may be 
1 0 included in any one of a variety of expression vectors for expressing polypeptides. Such 
vectors include chromosomal, nonchromosomal and synthetic DNA sequences, e.g., 
derivatives of SV40; bacterial plasmids; phage DNA; baculovirus; yeast plasmids; 
vectors derived from combinations of plasmids and phage DNA, viral DNA such as 
vaccinia, adenovirus, fowl pox virus, and pseudorabies. However, any other vector may 
15 be used as long as it is replicable and viable in the host. 

The appropriate DNA sequence may be inserted into the vector by any of a 
variety of procedures. In general, the DNA sequence is inserted into an appropriate 
restriction endonuclease site(s) by procedures well known in the art. which procedures 
are deemed to be within the scope of those skilled in this art. 
20 The DNA sequence in the expression vector is operatively linked to an 

appropriate expression control sequence(s) (promoter) to direct mRNA synthesis. As 
representative examples of such promoters, there may be mentioned: LTR or SV40 
promoter, the E. coli. lac or trp, the phage lambda P.sub.L promoter and other promoters 
known to control expression of genes in prokaiyotic or eukaiyotic cells or their viruses. 
25 The expression vector should also contain a ribosome binding site for translation 
initiation and a transcription terminator. The vector may also include appropriate 
sequences for amplifying expression. In addition, the expression vectors preferably 
contain one or more selectable marker genes to provide a phenotypic trait for selection 
of transformed host cells, such as dihydrofolate reductase or neomycin-resistance for 
30 eukaryotic cell culture, or such as tetracycline- or ampicillin-resistance in E. coli. 

The vector containing the appropriate DNA sequence as hereinabove described, 
as well as an appropriate promoter or control sequence, may be employed to transform 
an appropriate host to permit the host to express the protein. As representative examples 
of appropriate hosts, there may be mentioned: bacterial cells, such as E. coli, 
35 Streptomyces, Salmonella typhimurium; fungal cells, such as yeast; insect cells, such as 
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Drosophila S2 and Spodoptera Sf9; animal cells, such as CHO, COS or Bowes 
melanoma; adenoviruses; plant cells, etc. The selection of an appropriate host is deemed 
to be within the scope of those skilled in the art from the teachings herein. 

Synthetic production of nucleic acid sequences is well known in the art as is 
5 apparent from CLONTECH 95/96 Catalogue, pages 215-216, CLONTECH, 1020 East 
Meadow Circle, Palo Alto, Calif 94303. Thus, the present invention also includes 
expression vectors useful for the production of the proteins of the present invention 

The present invention further includes recombinant constructs comprising one or 
more of the sequences as broadly described above. The constructs may comprise a 

1 0 vector, such as a plasmid or viral vector, into which a sequence of the invention has been 
inserted, in a forward or reverse orientation. In a preferred aspect of this embodiment, 
the construct further comprises regulatory sequences, including, for example, a 
promoter, operably linked to the sequence. Large numbers of suitable vectors and 
promoters are known to those of skill in the art, and are commercially available. The 

15 following vectors are provided by way of example: Bacterial: pQE70, pQE60, pQE-9 
(Qiagen), pBS, pDlO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16a, 
pNHlSA, pNH46A (Stratagene), ptrc99a, pKK223-3, pKK233-3, pDR540 and pRIT5 
(Pharmacia); and Eukaryotic: pWLNEO, pSV2CAT, pOG44, pXTI, pSG (Stratagene) 
pSVK3, pBPV, pMSG, and pSVL (Pharmacia). However, any other suitable plasmid or 

20 vector may be used as long as it is replicable and viable in the host. 

Promoter regions can be selected from any desired gene using CAT 
(chloramphenicol acetyl transferase) vectors or other vectors with selectable markers. 
Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial 
promoters include lad, lacZ, T3, T7, gpt, lambda P.sub.R, P.sub.L and trp. Eukaryotic 

25 promoters include CMV immediate early, HSV thymidine kinase, early and late SV40, 
LTRs from retrovirus, and mouse metallothionein-I. Selection of the appropriate vector 
and promoter is well within the level of ordinary skill in the art. 

Components of the expression vector may generally include: 1) a neomycin 
phosphotransferase (G418), or hygromycin B phosphotransferase (hyg) gene as a 

30 selection marker, 2) an E. coli origin of replication, 3) a T7 and SP6 phage promoter 

sequence, 4) lac operator sequences, 5) the lactose operon repressor gene (laclq) and 6) a 
multiple cloning site linker region. Such an origin of replication (oriC) may be derived 
from pUC19 (LTI, Gaithersburg, Md.). 

A nucleotide sequence encoding one of the polypeptides SEQ ID NOS:2,4 and 6 

35 having the appropriate restriction sites is generated, for example, according to the PCR 
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protocol described in Example 1 hereinafter, using PGR primers having restriction sites 
for Kpnl (as the 5' primer) and NotI or Sad (as the 3' primer) for DPRP-l , or sites for 
Hindni (as the 5' primer) and NotI or BamHI (as the 3' primer) for DPRP-2 The PGR 
inserts are gel-purified and digested with compatible restriction enzymes. The insert and 
5 vector are ligated according to standard protocols. 

In a further embodiment, the present invention provides host cells containing the 
above-described constmcts. The host cell can be a higher eukaryotic cell such as a 
mammalian cell, or a lower eukaryotic cell, such as a yeast cell, or the host cell can be a 
prokaryotic cell, such as a bacterial cell. Introduction of the construct into the host cell 
10 can be effected by calcium phosphate transfection, DEAE-Dextran mediated 

transfection, lipofection or electroporation (Davis, L., Dibner, M., Battey, I., Basic 
Methods in Molecular Biology, (1986)). 

Such constructs in host cells are preferably used in a conventional manner to 
produce the gene product encoded by the recombinant sequence. Alternatively the 
1 5 polypeptides of the invention can be synthetically produced by conventional peptide 
synthesizers or by chemical ligation of suitable fragments thus prepared. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other 
cells under the control of appropriate promoters. Cell-free translation systems can also 
be employed to produce such proteins using RNAs derived from the DNA constructs of 
the present invention. Appropriate cloning and expression vectors for use with 
prokaiyotic and eukaiyotic hosts are described by Sambrook, et al.. Molecular Cloning- 
A Laboratory Manual, Second Edition, Cold Spring Harbor, N.Y., (1989) 

Transcription of the DNA encoding the polypeptides of the present invention by 
higher eukaiyotes is increased by inserting an enhancer sequence into the vector 
Enhancers include cis-acting elements of DNA, usually about from 10 to 300 bp that act 
on a promoter to increase its transcription. Examples include the S V40 enhancer on the 
late side of the replication origin bp 100 to 270, a cytomegalovirus early promoter 
enhancer, the polyoma enhancer on the late side of the replication origin, and adenovirus 
enhancers. 

Generally, recombinant expression vectors will include origins of replication and 
selectable markers permitting transformation of the host cell, e.g., the ampicillin- 
resistance gene of E. coli and S. cerevisiae TRPl gene, and a promoter derived from a 
highly expressed gene to direct transcription of a downstream structural sequence. Such 
promoters can be derived from operons encoding glycoljnic enzymes, such as 3- 
phosphoglycerate kinase (PGK), alpha-factor, acid phosphatase, or heat shock proteins 
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among others. The heterologous structural sequence is assembled in appropriate phase 
with translation initiation and termination sequences, and preferably, a leader sequence 
capable of directing secretion of translated protein into the periplasmic space or 
extracellular medium. Optionally, the heterologous sequence can encode a fusion 
5 protein including an N-terminal identification peptide imparting desired characteristics, 
e.g., stabilization or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a 
structural DNA sequence encoding a desired protein together with suitable translation 
initiation and termination signals in operable reading phase with a functional promoter. 

10 The vector will comprise one or more phenotypic selectable markers and an origin of 
replication to ensure maintenance of the vector and to, if desired, provide amplification 
within the host. Suitable prokaryotic hosts for transformation include E. coli. Bacillus 
subtilis. Salmonella typhimurium and various species within the genera Pseudomonas, 
Streptomyces, and Staphylococcus, although others may also be employed as a matter of 

15 choice. 

As a representative but non-limiting example, useful expression vectors for 
bacterial use can comprise a selectable marker and bacterial origin of replication derived 
from commercially available plasmids comprising genetic elements of the well known 
cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, 

20 pKK223-3 (Pharmacia Fine Chemicals, Uppsala, Sweden) and GEMl (Promega Biotec, 
Madison, Wis., U.S.A.). These pBR322 "backbone" sections are combined with an 
appropriate promoter and the structural sequence to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to 
an appropriate cell density, the selected promoter is induced by appropriate means (e.g., 

25 temperature shift or chemical induction), and cells are cultured for an additional period. 
Cells are typically harvested by centrifugation and then disrupted by physical or 
chemical means, with the resulting crude extract being retained for further purification. 
Microbial cells employed in expression of proteins can be dismpted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption and use of 

30 cell-lysing agents; such methods are well known to those skilled in the art. 

Various mammalian cell culture systems can also be employed to express a 
recombinant protein. Examples of mammalian expression systems include the COS-7 
lines of monkey kidney fibroblasts, described by Gluzman, Cell, 23 :175 (1981). Other 
cell lines capable of expressing a compatible vector include, for example, the CI 27, 3T3, 

35 CHO, HeLa and BHK cell lines. Mammalian expression vectors will generally comprise 
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an origin of replication, a suitable promoter and enhancer, and also any necessary 
ribosome binding sites, polyadenylation site, splice donor and acceptor sites, 
transcriptional termination sequences, and 5' flanking nontranscribed sequences. DNA 
sequences derived from the SV40 splice, and polyadenylation sites may be used to 
5 provide required nontranscribed genetic elements. 

The polypeptides can be recovered and purified from recombinant cell cultures 
by methods including ammonium sulfate or ethanol precipitation, acid extraction, anion 
or cation exchange chromatography, phosphocellulose chromatography, hydrophobic 
interaction chromatography, affinity chromatography, hydroxylapatite chromatography 
10 and lectin chromatography. Recovery can be facilitated if the polypeptide is expressed 
at the surface of the cells, but such is not a prerequisite. Recovery may also be desirable 
of cleavage products that are cleaved following expression of a longer form of the 
polypeptide. Protein refolding steps as known in this art can be used, as necessary, to 
complete configuration of the mature protein. High performance liquid chromatography 
15 (HPLC) can be employed for final purification steps. 

The polypeptides of the present invention may be purified natural products, or 
produced by recombinant techniques from a prokaryotic or eukaryotic host (for example, 
by bacterial, yeast, higher plant, insect or mammalian cells in culture). Depending upon 
the host employed in a recombinant production procedure, the polypeptides of the 
20 present invention may be glycosylated or may be non-glycosylated. Polypeptides of the 
invention may also include an initial methionine amino acid residue. 

In a preferred embodiment, the proteins of the invention are isolated and purified 
so as to be substantially free of contamination fi"om other proteins. For example, the 
proteins of the invention should constitute at least 80% by w^eight of the total protein 
25 present in a sample, more preferably at least 90%, even more preferably at least 95%, 
and most preferably at least 98% by weight of the total protein. 

These proteins may be in the form of a solution in water, another suitable solvent, 
such as dimethyl sulphoxide (DMSO) or ethanol, or a mixture of suitable solvents. 
Examples of mixtures of solvents include 10% (by weight) ethanol in water and 2% (by 
30 weight) DMSO in water. A solution may further comprise salts, buffering agents, 

chaotropic agents, detergents, preservatives and the like. Alternatively, the proteins may 
be in the form of a solid, such as a lyophilised powder or a crystalline solid, which may 
also comprise a residual solvent, a salt or the like. 

As used herein, the term "antibodies" includes polyclonal antibodies, 
35 affinity-purified polyclonal antibodies, monoclonal antibodies, and antigen-binding 
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fragments, such as F(ab02 and Fab' proteolytic fragments. Genetically engineered intact 
antibodies or fragments, such as chimeric antibodies, Fv fragments, single chain 
antibodies and the like, as well as synthetic antigen-binding peptides and polypeptides, 
are also included. Non-human antibodies may be humanized by grafting non-human 
5 CDRs onto human framework and constant regions, or by incorporating the entire 

non-human variable domains (optionally "cloaking" them with a human-like surface by 
replacement of exposed residues, wherein the result is a "veneered" antibody). In some 
instances, humanized antibodies may retain non-human residues within the human 
variable region framework domains to enhance proper binding characteristics. Through 

10 humanizing antibodies, biological half-life may be increased, and the potential for 
adverse immune reactions upon administration to humans should be reduced. 

Alternative techniques for generating or selecting antibodies useful herein 
include in vitro exposure of lymphocytes to human prohormone DPRP protein or a 
peptide therefrom, and selection of antibody display libraries in phage or similar vectors 

15 (for instance, through use of immobilized or labeled human DPRP protein or peptide). 
Genes encoding polypeptides having potential human DPRP polypeptide binding 
domains can be obtained by screening random peptide libraries displayed on phage 
(phage display) or on bacteria, such as E. coli. Nucleotide sequences encoding such 
polypeptides can be obtained in a nimiber of ways well known in this art. 

20 As would be evident to one of ordinary skill in the art, polyclonal antibodies can 

be generated from inoculating a variety of warm-blooded animals, such as horses, cows, 
goats, sheep, dogs, chickens, rabbits, mice and rats, with a human DPRP polypeptide or 
a fragment thereof. The immunogenicity of a human prohormone DPRP polypeptide 
may be increased through the use of an adjuvant, such as alum (aluminum hydroxide) or 

25 Freund's complete or incomplete adjuvant, or surface active substances, such as 
lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, KLH or 
dinitrophenol. Among adjuvants used in humans, BCG (bacilli Calmette-Guerin) and 
Corynebacterium parvum are especially preferable. Polypeptides useful for 
immunization also include fusion polypeptides, such as fusions of DPRP or a portion 

30 thereof with an immunoglobulin polypeptide or with maltose binding protein. The 
polypeptide immunogen may be a full-length molecule or a portion thereof. If the 
polypeptide portion is "hapten-like", such portion may be advantageously joined or 
linked to a macromolecular carrier, such as keyhole limpet hemocyanin (KLH), bovine 
serum albumin (BSA) or tetanus toxoid, for immunization. Antibodies to DPRP may 

35 also be generated using methods that are well known in the art. Such antibodies may 
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include, but are not limited to, polyclonal, monoclonal, chimeric, and single chain ' 
antibodies, Fab fragments, and fragments produced by a Fab expression library. 
Neutralizing antibodies (i.e., those which block or modify interactions at the active sites) 
are especially preferred for therapeutic use. 
5 For the production of antibodies, binding proteins, or peptides which bind 

specifically to DPRP, libraries of single chain antibodies. Fab fragments, other antibody 
fragments, non-antibody protein domains, or peptides may be screened. The libraries 
could be generated using phage display, other recombinant DNA methods, or peptide 
synthesis (Vaughan, T. J. et al. Nahire Biotechnology 14 : 309-314 (1966)). Such 
10 libraries would commonly be screened using methods which are well known in the art to 
identify sequences which demonstrate specific binding to DPRP. 

It is preferred that the oligopeptides, peptides, or fragments used to induce 
antibodies to DPRP have an amino acid sequence consisting of at least about 5 amino 
acids and, more preferably, of at least about 10 amino acids. It is also preferable that 
1 5 these oligopeptides, peptides, or fragments are identical to a portion of the amino acid 
sequence of the natural protein. Short stretches of DPRP amino acids may also be fused 
with those of another protein, such as KLH, and antibodies to the chimeric molecule may 
be produced. 

Monoclonal antibodies to DPRP may be prepared using any well known 
20 technique which provides for the production of antibody molecules by continuous cell 
lines in culture. These include, but are not limited to, the hybridoma technique, the 
human B-cell hybridoma technique, and the EBV-hybridoma technique, although 
monoclonal antibodies produced by hybridoma cells may be preferred. 

In addition, techniques developed for the production of "chimeric antibodies", 
25 such as the splicing of mouse antibody genes to human antibody genes to obtain a 
molecule with appropriate antigen specificity and biological activity, can be used, see 
Neuberger, M.S. et al. Nature 312: 604-608 (1984). Alternatively, techniques described 
for the production of single chain antibodies may be adapted, using methods known in 
the art, to produce DPRP-specific single chain antibodies. Antibodies with related 
30 specificity, but of distinct idiotypic composition, may be generated by chain shuffling 
from random combinatorial immunoglobulin libraries. (Burton D. R. Proc. Natl. Acad. 
Sci. 88 : 11120-11123 (1991)). 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening immunoglobulin libraries or panels of highly 
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specific binding reagents as disclosed in the literature. (Orlandi, R. et al. Profc. Natl/ 
Acad. Sci. 86 : 3833-3837 (1989)). 

Antibody fragments which contain specific binding sites for DPRP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 

5 fragments produced by pepsin digestion of the antibody molecule and Fab fragments 
generated by reducing the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab 
expression libraries may be constructed to allow rapid and easy identification of 
monoclonal Fab fragments with the desired specificity. (Huse, W. D. et al. Science 254 : 
1275-1281 (1989)). 

10 Various immunoassays may be used to identify antibodies having the desired 

specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well 
known in the art. Such immunoassays typically involve the measurement of complex 
formation between DPRP and its specific antibody. A two-site, monoclonal-based 
15 immunoassay utilizing monoclonal antibodies reactive to two non-interfering DPRP 
epitopes is preferred, but a competitive binding assay may also be employed. 

As earlier mentioned, the DPRPs can be used in treatment of the Diseases. 
Pharmaceutical compositions suitable for use in this aspect of the invention include 
compositions wherein the active ingredients are contained in an effective amount to 
20 achieve the intended purpose relating to one of the Diseases, The determination of a 
therapeutically effective dose is well within the capability of those skilled in the art and 
can be estimated initially either in cell culture assays, e.g. of neoplastic cells, or in 
animal models, usually mice, rats, rabbits, dogs, or pigs. An animal model may also be 
used to determine the appropriate concentration range and route of administration, which 
25 information is then commonly used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, e.g. a 
DPRP or fragment thereof, antibodies of DPRP, or an agonist, antagonist or inhibitor of 
DPRP, which ameliorates particular symptoms or conditions of the Disease. For 
30 example, the amount to be administered may be effective to cleave a desired target 
substrate upon contact therewith. Therapeutic efficacy and toxicity may likewise be 
determined by standard pharmaceutical procedures in cell cultures or with experimental 
animals, such as by calculating the ED50 (the dose therapeutically effective in 50% of 
the population) or LD50 (the dose lethal to 50% of the population) statistics. The dose 
35 ratio of toxic to therapeutic effects is the therapeutic index, and it can be expressed as the 
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LD50/ED50 ratio. Pharmaceutical compositions which exhibit large therapeutic indices 
are preferred. The data obtained from cell culture assays and animal studies is used in 
formulating a range of dosage for human use. The dosage contained in such 
compositions is preferably within a range of circulating concentrations that include the 
5 ED50 with little or no toxicity. The dosage varies within this range depending upon the 
dosage form employed, the sensitivity of the patient, and the route of administration. 

An exact dosage will normally be determined by the medical practitioner in light 
of factors related to the subject requiring treatment, with dosage and administration 
being adjusted to provide a sufficient level of the active moiety or to maintain a desired 
1 0 effect. Factors to be taken into account include the severity of the disease state, the 
general health of the subject, the age, weight, and gender of the subject, diet, time and 
frequency of administration, drug combination(s), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions may be 
administered every 3 to 4 days, every week, or even once every two weeks, depending 
1 5 on the half-life and clearance rate of the particular formulation. 

Yet another aspect of the invention provides polynucleotide molecules having 
sequences that are antisense to mRNA transcripts of DPRPl, DPRP2 and DPRP-3 
polynucleotides. Administration of an antisense polynucleotide molecule can block the 
production of the protein encoded by DPRP-1, DPRP2 or DPRP-3. The techniques for 
20 preparing antisense pol>Tiucleotide molecules and administering such molecules are 
known in the art. For example, antisense polynucleotide molecules can be encapsulated 
into liposomes for fusion with cells. 

In particular, the expression of DPRP-1. DPRP-2 and DPRP-3 in specialized 
epithelial cells, immune cells (lymphocytes and B cells), asfrocytic tumors, and in 
25 various hormone sensitive cancers provides evidence of a potential role in the 

pathophysiology of cancer, metaplasia and metastasis. Therefore in a further aspect, the 
invention relates to diagnostic assays for detecting diseases associated with inappropriate 
DPRP activity or expression levels. Antibodies that specifically bind DPRP may be used 
for the diagnosis of disorders characterized by expression of DPRP, or in assays to 
30 monitor patients being treated with DPRP or with agonists or antagonists (inhibitors) of 
DPRP. Antibodies useful for diagnostic purposes may be prepared in the same manner 
as those described above for therapeutics. Diagnostic assays for DPRP include methods 
that utilize the antibody and a label to detect DPRP in human body fluids or in extracts 
of cells or tissues. The antibodies may be used with or without modification, and they 
35 may be labeled by covalent or non-covalent joining with a reporter molecule. A wide 
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variety of reporter molecules are known in the art. Recombinant DPRP proteins that' 
have been modified so as to be catalytically inactive can also be used as dominant 
negative inhibitors. Such modifications include, for example, mutation of the active site. 
A variety of protocols for measuring DPRP, including ELISAs, RIAs and FACS, 
5 are known in the art and provide a basis for diagnosing altered or abnormal levels of 
DPRP expression. Normal or standard values for DPRP expression are established by 
combining body fluids or cell extracts taken from normal mammalian subjects, 
preferably human, with antibody to DPRP under conditions suitable for complex 
formation. The method for detecting DPRP in a biological sample would comprise the 

1 0 steps of: a) providing a biological sample; b) combining the biological sample and an 
anti-DPRP antibody under conditions which are suitable for complex formation to occur 
between DPRP and the antibody; and c) detecting complex formation between DPRP 
and the antibody, thereby establishing the presence of DPRP in the biological sample. 
The amount of complex formation then may be quantified by various methods, 

15 preferably by photometric means. Quantities of DPRP expressed in subject, control, and 
disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values estabUshes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding DPRP are 
used for diagnostic purposes, which polynucleotides may include oligonucleotide 

20 sequences, complementary RNA and DNA molecules, and PNAs. These 

polynucleotides may be used to detect and quantitate gene expression in biopsied tissues 
in which expression of DPRP may be correlated with one of the Diseases. The 
diagnostic assay may be used to distinguish between absence, presence, and excess 
expression of DPRP and to monitor regulation of DPRP levels during therapeutic 

25 intervention. Moreover, pharmacogenomic, single nucleotide polymorphisms (SNP) 

analysis of the DPRP genes can be used as a method to screen for mutations that indicate 
predisposition to disease or modified response to drugs. 

DPRP polynucleotide and polypeptide sequences, fragments thereof, antibodies 
of DPRPs, and agonists, antagonists or inhibitors of DPRPs can be used to as discovery 

30 tools to identify molecular recognition events and therefore proteins, polypeptides and 
peptides that interact with DPRP proteins. A specific example is phage display peptide 
libraries where greater than 108 peptide sequences can be screened in a single round of 
panning. Such methods as well as others are known within the art and can be utiHzed to 
identify compounds that inhibit or enhance DPRP-1, DPRP-2 or DPRP-3 activity. 

35 Coupled links represent functional interactions such as complexes or pathways, and 
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proteins that interact with DPRPs can be identified by a yeast two-hybrid system, 
proteomics (differential 2D gel analysis and mass spectrometry) and genomics 
(differential gene expression by microarray or serial analysis of gene expression SAGE). 
Proteins identified as ftinctionally linked to DPRPs and the process of interaction form 
5 the basis of methods of screening for inhibitors, agonists and antagonists and modulators 
of these DPRP-protein interactions. 

The term "antagonist," as it is used herein, refers to an inhibitor molecule which, 
when bound to DPRP, decreases the amount or the duration of the effect of the 
biological or immunological activity of DPRP, e.g. decreasing the en2ymatic activity of 
10 the peptidase to cleave the N-terminal dipeptide. Antagonists may include proteins, 
nucleic acids, carbohydrates, antibodies, or any other molecules which decrease the 
effect of DPRP; for example, they may include small molecules and organic compounds 
that bind to and inactivate DPRPs by a competitive or non-competitive type mechanism. 
Specific examples of DPRP tetrapeptide peptidic enzyme activity inhibitors are 
15 described in Example 6 and 7. Lihibitors can be, for example, inhibitors of the DPRP 
protease activity, or altematively inhibitors of the binding activity of the DPRP to 
proteins with which they interact. Specific examples of such inhibitors can include, for 
example, anti-DPRP antibodies, peptides, protein fragments, or small peptidyl protease 
inhibitors, or small non-peptide, organic molecule inhibitors which are fomiulated in a 
20 medium that allows introduction into the desired cell type. Altematively, such inhibitors 
can be attached to targeting ligands for introduction by cell-mediated endocytosis and 
other receptor mediated events. Such methods are described further below and can be 
practiced by those skilled in the art given the DPRP nucleotide and amino acid 
sequences described herein. 
25 A further use for DPRPs is for the screening of potential antagonists for use as 

therapeutic agents, for example, for mhibiting binding to DPRP, as well as for screening 
for agonists. DPRP, its immunogenic fragments, or oligopeptides thereof can be used 
for screening libraries of compounds which are prospective agonists or antagonists in 
any of a variety of drug screening techniques. The fragment employed in such screening 
30 may be free in solution, affixed to a solid support, borne on a cell surface, or located 

intracellularly. The formation of binding complexes between DPRP and the agent being 
tested is then measured. Other assays to discover antagonists that will inhibit DPRP are 
apparent from the disclosures of U.S. Patents Nos. 6,011,155, 6,107,317, 6,110,949, 
6,124,305 and 6,166,063, which describe inhibitors of DPPIV. Another worthwhile use 



wo 02/31134 PCT/USOl/31874 
of these DPRPs is the screening of inhibitors of DPPIV to show that they will not haVe 
undesired side effects by also inhibiting one or more of the DPRPs, 

A method provided for screening a library of small molecules to identify a 
molecule which binds DPRP generally comprises: a) providing a library of small 
5 molecules; b) combining the library of small molecules with the polypeptide of either 
SEQ ID NOS: 1, 3 or 5, or with a fragment thereof, under conditions which are suitable 
for complex formation; and c) detecting complex formation, wherein the presence of 
such a complex identifies a, small molecule which binds DPRP, 

One method for identifying an antagonist comprises delivering a small molecule 

10 which binds DPRP into extracts from cells transformed with a vector expressing DPRP 
along with a chromogenic substrate (e.g. Ala-Pro- AFC or Ala-Pro- AMC) under 
conditions where cleavage would normally occur, and then assaying for inhibition of 
cleavage by the enzyme by monitoring changes in fluorescence, or UV light absorption, 
by spectrophotometry to identify molecules that inhibit cleavage. A reduced rate of 

15 reaction or total amount of fluorescence or UV light absorption, in the presence of the 
molecule, establishes that the small molecule is an antagonist which reduces DPRP 
catalytic/enzymatic activity. Once such molecules are identified, they may be 
administered to reduce or inhibit cleaving by a DPRP. 

The term "agonist," as used herein, refers to a molecule which, when bound to 

20 DPRP, increases or prolongs the duration of the effect of DPRP. Agonists may include 
proteins, nucleic acids, carbohydrates, or any other molecules that bind to and modulate 
the effect of DPRP. Although it is less hkely that small molecules will prove to be 
effective DPRP agonists, a method for identifying such a small molecule, which binds 
DPRP as an agonist, comprises delivering a chromogenic form of a small molecule that 

25 binds DPRP into cells transformed with a vector expressing DPRP and assaying for 
fluorescence or UV light absorption changes by spectrophotometry. An increased 
amount of UV absorption or fluorescence would establish that the small molecule is an 
agonist that increases DPRP activity. 

Another technique for drug screening which may be used provides for high 

30 throughput screening of compounds having suitable binding affinity to the protein of 
interest as described in published PCT application WO84/03564. In this method, large 
numbers of different small test compounds are synthesized on a solid substrate, such as 
plastic pins or some other surface. The test compounds are reacted with DPRP, or with 
fragments thereof, and then washed. Bound DPRP is then detected by methods well 

35 known in the art. Purified DPRP can also be coated directly onto plates for use in the 
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aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies 
can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which 
neutralizing antibodies capable of binding DPRP specifically compete with a test 
5 compound for binding DPRP. In this manner, antibodies can be used to detect the 
presence of any peptide that shares one or more antigenic determinants with DPRP. 

As indicated above, by investigating the binding sites, hgands may be designed 
that, for example, have more interactions with DPRP than do its natural ligands. Such 
antagonist ligands will bind to DPRP with higher affinity and so function as competitive 

10 ligands. Alternatively, synthetic or recombinant proteins homologous or analogous to 
the ligand binding site of native DPRP may be designed, as may other molecules having 
high affinity for DPRP. Such molecules should also be capable of displacing DPRP and 
provide a protective effect. 

As indicated above, the knowledge of the structures of DPRP enables synthetic 

15 binding site homologues and analogues to be designed. Such molecules will facilitate 
greatly the use of the binding properties to target potential therapeutic agents, and they 
may also be used to screen potential therapeutic agents. Furthermore, they may be used 
as immunogens in the production of monoclonal antibodies, which antibodies may 
themselves be used in diagnosis and/or therapy as described hereinbefore. 

20 Given the ubiquitous expression of several members of the prolyl oligopeptidase 

S9B family, cell lines in which targeted gene disruption of DPPIV, DPRP-1, DPRP-2, 
DPRP-3, FAP and DPP VI to establish the null phenotype will be of great value to assist 
screening for selective and potent compounds. Accordingly, the invention provides such 
cell lines engineered with Lox-Neo IRES tk cassette and GFP-IRES-Neo Knock-in/out 

25 cassette DNA element for constructing somatic gene targeting vectors. 

Example 1 

Cloning and Expression of DPRP genes Using the Mammalian Expressioii System 

DNA fragments encoding the full-length polypeptide DPRP-1 were amphfied 
using PGR oligonucleotide primers corresponding to the 5' and 3' sequences of the gene, 

30 i.e. SEQ ID NO:45 and NO:46. In addition, DNA fragments encoding the full length 
polypeptide DPRP-2 were amplified using PGR oligonucleotide primers corresponding 
to the 5' and 3' sequences of that gene, i.e. SEQ ID NO:50 and NO:51. Furthermore, 
DNA fragments encoding the fiill length polypeptide DPRP-3 were amplified using PGR 
oligonucleotide primers corresponding to the 5' and 3* sequences of that gene, i.e. SEQ 

35 IDNO:55 andNO:56. 
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'"t^c three an-pUfied sequences were respeCvely isolated fton, a 0^7%' aga™^ 
gel usmg connnerciany available kit (GFX PCR DNA and Qel Band Purificatton K., 
Lerslpha™aciaBiotechI„c..Pisca.away».USA).TKe fragments w^^^^^^^^^^ 

llgated into cloning vector. pGEM-7Zf(-) (Promega CorporaUon. Mad, on WI, USA) 
5 and sequenced. The corresponding cloning constructs were respectively des.gnated 
P0EM7-DPRP1, PGEM7-DPRP2 and pOEM7-DPPP3. The DNA — 
the truncated DPRP-1 or DPRP-2 or DPRP-3 were amplified usmg pOEM7-DPRPl or 
0GEM7-DPRP2 or pGEM7.DPRP3 as a template and EPR oligonucleotide pntners. 
SEO ID NO-45 and NO:47 were used for DPRP-U SEQ ID NO:50 and N0.52 were used 
,0 for DPRP-2 and SEQ ID NO:57 and NO:58 for DPRP-3. "H-e amplified sequences were 
Igain isolated ftom a 0.7% agarose gel using the same punfication kits and s,. -cloned 
Lrp^EM-7Zf(-). The resulttng constructs were designated pOEM7-DPRPlf, pGEM7- 

"''"''77!.^= DP^'— lian expression construct, pGEM7-DPRPl was 
, 5 digested with the restriction enzymes Kpnl and NoU to release the full length DPI^-1 
gene The DNA fragment carrying the DPRP- 1 gene was gel band punfied usmg *e 

Lve Idt and then inserted into expression vector pcDNA3 (btvitrogen. Ca^ad CA 
USA) .0 make the native DPRP-1 expression construct, which was designated pcDNA- 
DPRPl. pGEM7-DPRPlf was digested withthe restriction enzymes X^alan^Htn^^ 
20 to release the truncated DPRP-lf gene. The DNA fragment carrymg the DPRP-lf gene 
.as gel band purified using the above kit and then inserted imo expresston vec«.r 
pcDNA3.I(-)/myc.His A (Invitrogen, Carlsbad CA, USA) to make the tagged DPRP-1 
expression construct pcDNA-MycHis-DPRPl. rFM7DPRP2was 
To make the DPRP-2 mammalian expression construct, pGEM7-DPRP2 was 
25 digested with the restriction enzymes Hindni and BamHI to release the ft.ll leng* 
DPRP-2 gene. The DNA fragment canying the DPRP-2 gene was gel hand pnrrfled 
using the above kit and then inserted into expression vector pcDNA3 (Invitrogen, 
Carlsbad CA, USA) to make the native DPRP-2 expression construe, wh.ch was 
designated PCDNA-DPRP2. pQEM7-DPRP2f was digested with the resmCon enzymes 
aesigudi H ,,^^nPRP 2feene The DNA fragment canying 

30 EcoRI and BamHI to release the truncated DPRP-2f gene, in ^ 

1 i,.nH T.„riried using the above kit and then inserted into 
theDPRP-2f gene was gel band puritiea using uic a 

exlLon vector pcDNA3.U-)/myc-His B Onvitrogen, Carlsbad CA USA, .0 ntake the 
tagged DPRP-2 expression construct designated pcDNA-MycH,s-DPRP2. 

TO make the DPRP-3 mammalian expression construct, PC.EM7-DPRP3 was 
digested w,.h the restriction enzymes EeoRI and Xhol to release the fidi length DPRP-3 
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rrXhe DNA fragment casing the DPRP-3 gene was gel band purified us^g ^e 

T to make the native DPRP-3 expression construct designated pcDNA-DPRP3. 

3 'I;atedDPKP.fgene. The DNA « ca^.g the DPRP-3 fgene^.^^ 

nurified using the above kit and then inserted into expression vector pcDNA3.1(-)/myc 
punfied using tn ^.^e the tagged DPRP-3 expression construct 

His B (Invitrogen, Carlsbad CA, UbAj to maKc bb 

pcDNA-MycHis-DPRP3. 

Example 2. 

ruiiP iliiii nitrn -^P P"" '"^"^^ . , 

Q„a„U..ive PCR analysis was ca.ied ou. .0 examine *e levels — of 
the nJWAs tor the polypeptides of the present invention in human tissues. RT PCR was 

u. on a nuler of huntan cel. lines including but not luni.ed . pros^e 
:lrce,lsaKCaP,PC3,OUl.,,..eM.TC-.line(^sete^^^^^^ 

, , .,11, (breast cancer). Bands of the expected sizes for DPRP- 1 , DPRP 2 ana urr 

:;ip:sr:.hevaHouscancerceUs,lnes,withFAPalsoheingexpressedatveryl^ 

levels. 

'^-*^^^if^?.lyslswasperforn,edwi,h..SPoMAr-A isolated 

.no r>PT?P orobes Specifically, a human Multiple Tissue 
" rl";«rParl,Uwasprohedwith^ 
::rL2heenradioact.,y,a.,.. 

"PdCTP (A. P. Feinberg et al., .^laL^iochenL, 13i, ^ Uys-^;;- y 
perforn.ela..SXoverni,htinExpressHyh™hy— solution^^^^^^^^ 
,5 Alto calif.). Theblotswerefirstwashedatroomtemperaturem2t,mesSSCand 
25 Alto, t-alil.) (DPRP-3) m 0.1 

0.05% SDS, and then washed at 60 C (UFKf , 

•"^^ 'LdS*owed expresston of DPRP-1 . several tissues with the ntos. 
ahundartta. he J>n testis, prostate, muscle and brain. Testis showed 3 transcrrpts 

, n . . . .nH 9 S kb in length The shorter mRNA species was very 
Zi:^::^^:^^^^----- OPRP-2wasub.ui.ous,y 
X led in every tissue wth highest levels in liver and muscle and a pre— 

n, „ 5kb DPRP-3 expression was limited to bram and pancreas. Further 
"llclLforthtthree proteases in s.^ 

col:; meduila, sp.na. cord, occipital lobe, fron.a, lobe temporal iobe and putan^en). 
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DPRP-1 was expressed in all regions with low levels present in the spinal corii, while 
DPRP-2 was expressed in all brain regions tested. 

Oligonucleotide primers SEQ ID NO:48 and NO:49 were used for DPRP-1 
quantitative PGR, whereas oligonucleotide primers SEQ ID NO:53 and NO:54 were 
5 used for DPRP-2 quantitative PGR. Human Multiple Tissue cDNA (MTC™) Panel I 
and Panel 11 (Clontech, Palo Alto CA, USA) were used as normalized cDNA templates. 
0.5 ng of each cDNA were used in a 25 [i\ PGR reaction, with each primer at a final 
concentration of 300 nM. The PGR reaction was performed using a SYBR Green PGR 
Gore Reagents Kit (Applied Biosystems, Foster Gity CA, USA) and detected with an 

10 Applied Biosystems GeneAmp 5700 sequence detection system. Manufacturer's 

recommended thermal cycling parameter, e.g. 50°G for 2 min, 95°C for 10 min followed 
by 40 cycles of 95°G for 15 sec and 60°G for 1 min was used. Data obtained shows 
relatively high rates of expression for both DPRP-1 and DPRP-2 in the pancreas, ovary 
and testis, and a particularly high rate for DPRP-2 in the liver. 

15 Example 3 - Production of DPRP Polyclonal Antibodies and Western Blotting 
The amino acid sequence deduced from the cDNA encoding DPRP-1 was 
analyzed using DNASTAR software (DNASTAR, Inc.) to determine regions of high 
immunogenicity, and a corresponding oligopeptide was synthesized and used to raise 
anti-DPRP-1 antibodies. The procedure was repeated for DPRP-2 and DPRP-3. The 

20 selection of appropriate peptide sequences and the techniques for antibody production 
are methods well known to those of skill in the art. Selection of appropriate epitopes, 
such as those near the G-terminus or in hydrophilic regions, is well known in this art. 

Typically, oligopeptides that are about 15 to 20 residues in length, e.g. SEQ ID 
NO:59 for DPRP-1, SEQ ID NO:60 for DPRP-2 and SEQ ID NO:61 for DPRP-3, were 

25 synthesized using an AppHed Biosystems Peptide Synthesizer Model 431 A. 

Fmoc-chemistry was used and the 19- or 15-residue peptides were respectively coupled 
to keyhole limpet hemocyanin (KLH, Sigma, St. Louis, Mo.) by reaction with 
N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS). Rabbits were immunized 
with the oligopeptide-KLH complex in complete Freund's adjuvant. The resulting 

30 antisera were tested for antipeptide activity, e.g., by binding the peptide to plastic, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with 
radioiodinated, goat anti-rabbit IgG. 

Western blotting was performed using normal human protein samples (Protein 
Medley) obtained from Glontech (about 36 \xg of total proteins). Proteins were 

35 fractionated through 10% SDS-polyacrylamide gels, and transferred to 0.45 mm 
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"""""""^ . Membranes wer» blocked in Tris-buffered saline (TBS) with 

„i„oceUulose — - — „ ^PKp.a specific antibodies were used 
n 05% Tween 20 and 1% BaA. Anvi , ,. „u rv n«;o/ 

, A■^ ,^A\-'^ nno in Tris-buffered salme with 0.05% 

T«een 20 (TBST and th ^^^^^^ 

^ ::,^b7— ^^^^^^^ 

for AP until the bands of interest Irave reached the desired intensrty. DPRP-1 and 
Lm.rproteins we. detected in brain, nruscles. kidney, prostate, -^ ^J 

/dPRP-1 and DPRP-2 were synthesized as approximately lOlkDa and lOOkDa 
rrrejele,: Which are insooda^eemen. With the molecular masses esthna^ 

from their primary structure as shown m Table 3. 
ScL 78:3824-3828 (1981). 
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several additional bands of similar molecular wergh. were observed. These are 
t\ ,„ ,he presence of post-translational glycosylation of the protems. 
though. .0 be due to the P"-";"^ „ sites for the DPRP proteins. 

T«hlp 3 also shows the number of potential in giy^^ y 

Te — of glycosylated and unglcosylated forms of the proteins was evalua«d 

The presence gly y charide synthesis. It is evident that the 

using mnicamycn. an .nh.brto of *e ol,g ^^^^ 
smaller forms were unglycosylated forms. The ^ 
analysis) and protein quantity (Western analysis) for DPRP 
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Example 4 

Tmmnnohisto phpmif'al localization of DP RP proteins in human tissues 

Four-micron sections were prepared from a number of different formalin-fixed, 
paraffin-embedded human tissues. Tissue sections were deparaffined through 4 

5 immersions in xylenes for 5 minutes, followed by a graded alcohol series to distilled 
water. Steam heat induced epitope recovery (SfflER) was used with several different 
SHIER solutions with and without enzyme digestion tissue in two different 
concentrations (Ladner et al, Cancer Res. : 60, p 3493-3503, 2000). The treatments and 
antibody dilutions employed are outlined below. 

IQ 1. Blocking Reagent for 15 minutes (Normal Goat Serum) 

2. Primary Antibody for 25, 60 min or overnight incubation 

3. Secondary Antibody for 25 minutes (Biotinylated Goat-anti-rabbit IgG) 

4. Endogenous Peroxidase Blocking for 3 x 1 .5 minutes 

5. ABC (avidin-biotin complex) / Horse Radish Peroxidase for 25 minutes 
15 6. DAB Chromogen for 3 x 5 minutes (Brown reaction product) 

7, Light Hematoxylin Counter Stain 1 minute 
Positive controls were run to assure the detection chemistries and antigen 
pretreatments were working appropriately. Rabbit IgG was run as a negative control. 
An avidin-biotin based tissue staining system was used for the detection of the DPRP-1 
20 antibody. Horseradish peroxide was used as a reporter enzyme with DAB as chromogen. 
After staining, slides were dehydrated through an alcohol series to absolute ethanol 
followed by xylene rinses. Slides were permanently coverslipped with glass coverslips 
and permount. Digital images of representative staining, where positive staining was 
indicated by a dark brown chromogen (DAB-HRP reaction product), were captured 
25 using a video camera from Olympus. Hematoxylin counterstain provides a blue nuclear 
stain to assess cell and tissue morphology. 

DPRP-1 rabbit polyclonal antibody labels formalin-fixed, paraffin-embedded 
human tissues, including normal testis, prostate glands, endometrial glands, tonsils and 
pancreas. It was also present in endothelial cells of normal ovary, bladder and kidney. 
30 Staining was localized in the cytoplasm in epithelial and some stromal cells such as 

fibroblasts, endothelial cells and lymphocytes. Interestingly in normal testis tested with 
DPRP-1 antibodies, there was distinctive expression in Leydig cells and multinucleated 
macrophages found in interstitial tissue, which is the space surrounding the seminiferous 
tubules. Tonsil B cells were stained with DPRP-1 antibody. 
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Example 5 

H..^^<.H«n .nrt Insect Cell FYpresston of DPRP Proteins and Purification 

Plasmid DNA of pcDNA-DPRPl, pcDNA-MycHis-DPRPl, pcDNA-DPRP-2 or 
pcDNA-MycHis-DPRP2 was transfected into PEAK (EdgeBioSystems, Gaithersburg 

5 MD, USA) or COS- 1 (ATCC CRL-1 650) using LipofectAmine (Life Technologies. 
Gaithersburg MD, USA) method reconimended by the manufacturer. Transfected cells 
were maintained in DMEM with 5% FBS at 37°C with 5% CO, for 48 hours. Cells were 
then collected and used for recombinant protein extraction. Cells were harvested 48 
hours after transfection, homogenized and then spun at 18,000 x g for 40 min. The 

10 supemata were collected as cytosolic fractions. This fraction was loaded on TALON 
spin column (Clontech), and His-tagged proteins were eluted with 50mM PBS, 150mM 
imidazole, pH 7. Recombinant proteins were then detected by western blotting with 
anti-myc Antibody and visualized using a ProtoBlot II AP system (Promega). 
Recombinant affinity purified fiisions of the DPRP-1 and DPRP-2 were detected by 

15 western blot, and DPRP-1 and DPRP-2 were synthesized as 1 12kDa and 109kDa forms 
as predicted. 

Naturally occurring or recombinant DPRP proteins were substantially purified by 
immunoaffinity chromatography using antibodies specific for DPRP-1, DPRP-2 or 
DPRP-3. An immunoaffmity column was constructed by covalently coupling DPRP 
20 antibodies to an activated chromatographic resin, such as CNBr-activated Sepharose 

(Pharmacia & Upjohn). After the coupling, the resin was blocked and washed according 
to the manufacturer's instructions. 

Media or cell extracts containing DPRP proteins were passed over the 
immunoaffinity column, and the column was washed under conditions that allow the 
25 preferential absorbance of DPRPs (e.g., high ionic strength buffers in the presence of 

detergent). The column was eluted under conditions that disrupt antibody/DPRP binding 
(e.g., a buffer of pH 2-3 or a high concentration of a chaotrope, such as urea or 
thiocyanate ion), and purified DPRP was collected. 

Exam ple 6 

30 vr..ym.tic Activity nf nPRP pro foin^ «nri Methods of Screening for Inhibitors 

The kinetic properties of recombinant DPRP-1 and DPRP-2 were detemiined in a 
continuous fluorimetric assay. Buffer, pH and temperature dependence optimization led 
to the following assay conditions: Enzyme assays were performed in 50mM PBS, pH7.4 
50 ^1 (50 tig/ml) of purified enzymes were mixed with 1 ^1 of different concentration of 
35 Ala-Pro-AMC (Enz>'me Systems). Plates were then incubated at 37°C for 30 min, and 
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fluorescence was detected using a Wallac 1420 Fluorimeter with Xex40355 and Xem535. 
The K„ values of DPRP-1 and DPRP-2 were similar (208 and 161 respectively). 

Further biochemical characterization reveals that DPRP-1 and DPRP-2 have 
similar profiles to DPPIV. The two purified proteases and DPPIV were preincubated 
5 with inhibitors at room temperature for 30 min. Substrate, Ala-Pro-AMC (1 00 ^M), was 
then added, and the fluorescence intensity was recorded as 60 readings during a 60 min 
period. The irreversible serine protease inhibitor AEBSF was the only inhibitor tested 
that showed strong inhibition of all three enzymes (Table 5). This confirms the structural 
and domain analysis prediction that these proteins belong to the serine protease 

10 superfamily. 

Table 5. Inhibition of DPRP-1 and DPRP-2 by Protease Inhibitors 



15 



20 



Inhibitor 


Inhibitor Property 


Concentration 


Residual activity 
(Vo of control) 










DPRP-1 


DPRP-2 


DPPIV 


AEBSF 


serine, irreversible 


5mM 


29.6 


23.9 


21.1 


Aprotinin 


serine, reversible 


5 Jig/ml 


77.5 


63.2 


80.2 


Pepstatin 


aspartic, reversible 


2fig/ml 


97.3 


95.0 


93.5 


DTT 


cysteine 


2mM 


100.1 


94.8 


98.3 


B-Mercaptoethonal 


cysteine 


lOOmM 


93.2 


84.0 


98.0 


EDTA 


metallo, reversible 


2mM 


91.5 


86.0 


93.5 


Leupeptin 


serine, reversible 


50|ig/ml 


91.1 


90.4 


90.7 



25 



In addition to Ala-Pro-AMC, additional substrates tested also confirmed that 
DPRP-1 and DPRP-2 are dipeptidyl peptidases. The data were derived by determining 
the fluorescence change following a 30-minute incubation of the substrates (125 ^M) 
with enzymes as a percentage of the fluorescence measured at Ala-Pro-AMC and Gly- 
Pro-AMC were the only good substrates among those tested. 
Table 6. DPRP-1 and DPRP-2 are dipeptidyl peptidases. 



Substrate 


% Change 


in Fluorescence at 30 minutes 




DPRP-1 


DPRP-2 


DPPIV 


Ala-Pro-AMC 


239.0 


127.5 


379.0 


CTlv-Pro-AMC 


1 341.5 


205.0 


444.0 


Ala-Pro-pNA 


45.5 


44.0 


29.5 


Pro-pNA 


-1 


-2.5 


0.0 


Glv-Ara-pNA 


-4.5 


-0.5 


0.0 


Lvs-Ala-pNA 


2.5 


0.5 


0.5 


Ala-Phe-Pro-pNA 


-4 


-0.5 


2.0 



30 
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Additional natural and non-natural amino acid di-, tri- and tetra-peptides were tested m 
order to find an optimal substrate for testing each of the DPRP proteins that will also 
show reduced activity when incubated DPPIV. 

The enzyme assay method described here is one of a number of methods that can 
5 be utilized to screen for peptide and non-peptide inhibitors of the DPRP enzymes. 

Libraries of tetrapeptide inhibitors were tested to discover inhibitors of enzyme activity. 
Candidate inhibitors were prepared as 10-20 mM stock solutions in DMSO and stored at 
20°C Dilutions were made in assay buffer. Inhibition was determined by companng 
the changes in fluorescence of the inhibited enzyme to the change in fluorescence of the 
control (vehicle) enzyme. 100-(n units of sample/fl units of control x 100) gives 
percent inhibition value. The percent inhibition and the inhibitor concentration at which 
the enzyme was 50% inhibited (IC^^) was ascertained by plotting percent inhibition vs. 
inhibitor concentration on the log scale. As shown in Figure 3, several tetrapeptides 
amides inhibited enzyme activity, wherem data are expressed as the % of activity m the 
presence of vehicle (0.02% DMSO) alone. Compounds were added at 1 mM. Most 
interesting was the apparem differential activity of some tetrapeptides for DPRP-1 and 
DPRP-2 compared to DPPIV. While all three enzymes were inhibited by Peptide- 1, 
only DPRP-1 and DPRP-2 were significantly inhibited by Peptide-4 and Peptide-5. This 
demonstrates that selective inhibition of the purified enzymes is achievable. 

The assay described in this example can also be used to screen additional 
synthetic or naturally occurring compound libraries, including macromolecules, for 
agents that either inhibit or enhance DPRP activity. The DPRP-1 and DPRP-2 
polypeptides to be used in the assay can be obtained by, for example, in vitro translation, 
recombinant expression (see Example 5) or biochemical procedures. Methods other than 
those described here can also be used to screen and identify compounds that inhibit 
DPRP-1, DPRP-2 or DPRP-3, which methods can include, for example, binding assays 
such as ELISAs and RIAs. 

Exam ple 7 

yff,,, ^.»PPP Tnh.hifnrs o r Prnlffrration of Hiim.n ranrer Cells In Vitro 

hi an attempt to assess the effect that several inhibitors of DPRP-1 and DPRP-2 
activity may have on the proliferation of human cancer cells, LNCap, PC3 and Dul45, 
mouse testis line MLTC-1 andMDA-MB231 breast cancer cells were plated (10 per 
well) in 96-well tissue culture plates and allowed to grow and attach for 24 hours at 37°C 
in a CO, incubator. Compounds at various dilutions (final dilutions: 0.1 nM - 10 ^M) 
were then added to the wells for various incubation periods from 24 hours to 96 hours, 
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with fresh compound being replaced each day. Addition of the diluent DM^O alone 
served as the control. Following incubation with these compounds in triplicate, 
proliferation of the cells was determined using an XTT cell proliferation assay (Roche 1- 
465-015). The plates were read at 490 and 650nm 5 hours after the XTT mix was added. 
5 An increase in cell proliferation was observed with three of the inhibitors at 

concentrations equal to 0.1, 1, 10 and 100 x IC50, and the results are shown in FIGS. 4A, 
4B and4C for PC3 cells. 

Overall, the DPRPs are expressed in a wide variety of tissues as has been 
demonstrated by mRNA amplification, western blotting and immunohistochemistry. 

10 DPRP-1 was most abundant in the testis by Northern blot and western blot. The large 
number of expressed sequence tags (ESTs) from testis cDNA sources that are 
homologous to DPRP-1 also confirms abundant expression of DPRP-1 in testis. 
Example 4 describes the immunohistochemical localization of DPRP-1 protem in human 
testis using a specific DPRP-1 antibody. DPRP-1 is strongly expressed in epitheloid 

15 Leydig cells, and Leydig cells are the primary source of testicular androgens (male 

steroid hormones) in the mammalian male. In the interstitium of the testis, Leydig cells 
and macrophages are in close association with "digitation" of Leydig cell process 
extending onto macrophage surface. Multinucleated cells in close proximity to the 
Leydig cells were also stained with DPRP-1 antibody suggesting that the protease was 

20 also expressed in macrophages, and macrophages in the testis play an important role in 
the paracrine regulation of Leydig cells. Cytokines secreted by the testicular 
macrophages are mitogenic to Leydig cells and play an important role in the 
differentiation of mesenchymal progenitor cell into mature Leydig cells. A clearer 
understanding of the proteins and pathways involved in the maturation of the testis is 

25 important for the discovery of new treatments for precocious puberty. In addition, 

Leydig cells cause tumors such as sex cord-stromal tumors via sexual steroid production 
(predominantly testosterone). Testosterone is associated with several neoplasia and 
diseases such as breast carcinoma and uterine cancers, ovarian carcinoma and 
androgenic alopecia (hair loss). Further examination of the localization of DPRP 

30 proteins in other glands in the body (e.g. adrenal glands) that produce testosterone and 
other androgenic hormones are currently under investigation. The possible association 
of DPRP-1 with steroid and polypeptide hormone biosynthetic pathways functions is 
being investigated, and Example 7 is relevant to understanding the role of DPRP proteins 
in prostate, testis and breast in vitro cell models. 
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Immimohistochemical analysis also localized DPRP-1 to endometrial glands in 
the uterus (see Example 4), pancreatic acini, glomeruli of the kidney, plasma cells in the 
bladder, a subset of B-cells in the tonsils, colunmar epithelial cells of the prostate and 
poorly differentiated prostate squamous metaplasia, Gleason grade 4 prostatic 
5 carcinoma, and hyperplastic glands in benign prostatic hyperplasia. Positive staining in 
breast carcinoma, as well as in seminoma and prostate squamous metaplasia, suggests a 
general association of DPRP-1 with hormone-sensitive tissues, particularly in cells that 
become poorly differentiated. The presence of the DPRP-1 in specialized epithelial cells 
and in inflammatory plasma cells (lymphocytes) is also of interest. Inflammatory breast 
10 carcinoma has an abundance of infiltrating lymphocytes and an overall bad prognosis. 
DPRP-1 and other DPRP proteins appear in medullary carcinomas that typically have a 
constant infiltrating lymphoplasmacytic component at the periphery of the tumor, which 
is thought to represent a reaction of the host tissues to the neoplasm. Most of the 
lymphocytes are T Cells, and most of the plasma cells are of the IgG-producing type. 
15 Several antigens are abundant on B cells, a subgroup of breast-cancer cells, and other 
epithelial cancer cells, and these antigens are targets for a new class of therapeutic 
monoclonal antibodies with some notable success having been achieved with a 
humanized monoclonal antibody against the B-cell-specific antigen CD20. 
Accordingly, monoclonal antibodies to DPRP proteins are felt to be useful to diagnose 
20 and treat diseases in which they are involved, including cancer. 

The expression of DPRP-1 in specialized epithelial cells of a number of tissues 
suggests that DPRP-1 and other DPRP proteins may be involved in growth and 
differentiation thereof Testing using inhibitors described in Example 6 in in vitro 
models of prostate and testis cancer (Example 7) showed that DPRP-l/DPRP-2 
25 inhibitors caused a 50-60% increase in proliferation of PC3 cells at nM concentrations as 
shown in FIGS. 4A-4C, 

Although the invention has been described in accordance with its preferred 
embodiments, which constitute the best mode presently known to the inventors, it should 
be understood that changes and modifications as would be obvious to those skilled in 
30 this art may be made without departing from its scope which is set forth in the claims 
appended hereto. For example, although the disclosure focuses on DPRP-1 and DPRP-2 
in certain instances, DPRP-3 and its fragments are considered to be similarly useful, as 
are nucleic acids encoding same. Particular features of the invention are emphasized in 
the claims that follow. 
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CLAIMS: 

1 . Isolated nucleic acid 

which encodes (a) a polypeptide, which includes the amino acid sequence of 
one of SEQ ID NOS; 1, 3 and 5, or (b) a polypeptide having an amino acid sequence that is 
at least about 70% similar thereto and exhibits the same biological function; 

or which is an alternative splice variant of one of SEQ ID NOS:2, 4 and 6; or 

which is a probe comprising at least 14 contiguous nucleotides from said 
nucleic acid encoding (a) or (b) ; or 

which is complementary to any one of the foregoing. 

2. The isolated nucleic acid of claim 1 which is DNA or RNA. 

3. The isolated nucleic acid of claim 1 which is a DNA transcript that includes 
the entire length of any one of SEQ ED NOS;2, 4 and 6 or which is complementary to the 
entire coding region of one of SEQ ID NOS:2, 4 and 6. 

4. An antisense oligonucleotide directed against the DNA of claim 3. 

5. The isolated nucleic acid of claim 1 which is an RNA transcript which 
includes the entire length of any one of SEQ ID NOS:2, 4 and 6. 

6. The isolated nucleic acid of claim 1 which is an alternative splice variant of 
one of SEQ ID NOS:2, 4 and 6. 

7. A polypeptide encoded by the nucleic acid of claim 6. 

8. The isolated nucleic acid of claim 1 which encodes a polypeptide having an 
amino acid thai is at least about 90% similar to one of SEQ ID NOS:l, 3 and 5. 

9. The isolated nucleic acid of claim 1 which encodes a polypeptide having an 
amino acid that is at least about 95% similar to one of SEQ ID NOS:l, 3 and 5- 

10. The isolated nucleic acid of claim 1 which encodes a polypeptide that has at 
least about 90% identity with one of SEQ ID NOS:l, 3 and 5. 

11. A nucleic acid probe according to claim 1 comprising at least 14 contiguous 
nucleotides from one of SEQ ID NOS:2, 4 and 6. 

12. An isolated recombinant polynucleotide molecule comprising nucleic acid 
according to claim 1 plus expression-controlling elements linked operably with said nucleic 
acid to drive expression thereof. 

13. An expression vector comprising the nucleic acid of claim 1 encoding a 
polypeptide having the entire amino acid sequence set forth in any one of SEQ ID NOS:l, 3 
and 5 operably linked to a promoter, said expression vector being present in a 
compatible host cell. 
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14. A mammalian, insect or bacterial host cell that has been genetically 
engineered by the insertion of nucleic acid according to claim 1 which codes for at least the 
mature protein portion of the amino acid sequence of SEQ ID NO: 1, 3 or 5. 

15. A process for producing a polypeptide which includes the mature protein 
portion of one of SEQ ID NOS: 1 , 3 and 5, which process comprises culturing the host cell 
of claim 1 1 under conditions sufficient for the production of said polypeptide. 

16. The process of claim 15 wherein said polypeptide is expressed at the 
surface of said cell and further includes the step of recovering the polypeptide or a fragment 
thereof from the culture. 

17. A polypeptide 

which may be optionally glycosylated, and 

which (a) has the amino acid sequence of a maftire protein set forth in any one 
of SEQ ID NOS: 1 , 3 and 5; (b) has the amino acid sequence of a mature protein having at 
least about 70% similarity to one of the mature proteins of (a) and which exhibits the same 
biological function; (c) has the amino acid sequence of a mature protein having at least 
about 90% identity with a mature protein of any of SEQ ID NOS: 1, 3 and 5; or (d) is an 
immunologically reactive fragment of (a). 

1 8. The polypeptide according to claim 14 which is a mature protein having at 
least about 95% similarity to a mature protein of (a). 

19. The polypeptide according to claim 14 which is a mature protein having at 
least about 95% similarity to a mature protein of (a). 

20. The polypeptide according to claim 14 having the amino acid sequence of the 
mature protein of one of SEQ ID NOS:l, 3 and 5, or is a fragment thereof which exhibits 
the same biological fimction as the respective mature protein. 

21. A DPRP antagonist which inhibits the biological function of one of said 
mature proteins of claim 17, 18 and 19. 

22. An antibody that recognizes a polypeptide or a fragment according to 
claim 17. 

23. The antibody of claim 22 which recognizes a polypeptide having an amino 
acid sequence of SEQ ID NO: 1 or 3 or 5. 

24. A method for the screening for a compound capable of inhibiting the 
enzymatic activity of at least one maUire protein of claim 17, which method comprises 
incubating said mature protein and a suitable substrate for said mature protein in the 
presence of one or more test compounds or salts thereof, measuring the enzymatic activity 
of said mature protein, comparing said activity with comparable activity determined in the 
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absence of a test compound, and selecting the test compound or compounds that reduce the 
enzymatic activity. 



25. A method for the screening for a compound capable of inhibiting the 
enzymatic activity of DPPIV that does not inhibit the enzymatic activity of at least one of 
the mature proteins of claun 20, which method comprises incubating said mature protein 
and a suitable substrate for said mature protein in the presence of one or more inhibitors of 
DPPIV or salts thereof, measuring the enzymatic activity of said mature protein, comparing 
said activity with comparable activity determined in the absence of the DPPIV inhibitor, 
and selecting a compound that does not reduce the enzymatic activity of said mature 
protein. 
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dlqiypqerhsirvpesgrhyelhllhylqenlgsriaalkvi 
dlqiypqerhsirvpesgehyelhllhylqenlgsriaai.kvi 
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FIG. 4B 



0.0 0.1 1.0 10.0 100.0 

pnhibitor 65] fold IC 50 



o 




FIG. 4C 



0.0 0.1 1.0 10.0 100.0 

[Inhibitor 66] fold IC 50 
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Seq uence Listing Summary 



SEQ m. 

1. DPRPl a.a. sequence 

2. DPRPl DNA sequence 

5 3. DPRP2 a.a. sequence 

4. DPRP2 DNA sequence 

5. DPRP-3 a.a. sequence 

6. DPRP-3 DNA sequence 

7. DPRP-1 transcript 0 a.a. sequence 
10 8 . DPRP- 1 transcript 0 DNA sequence 

9. DPRP-1 transcript 1 a.a. sequence 

10. DPRP-1 transcript 1 DNA sequence 

11. DPRP-1 transcript 2 a,a. sequence 

12. DPRP-1 transcript 2 DNA sequence 
15 13. DPRP-1 transcript 3 a.a. sequence 

14. DPRP-1 transcript 3 DNA sequence 

15. DPRP-1 transcript 4 a.a. sequence 

16. DPRP-1 transcript 4 DNA sequence 

17. DPRP-1 transcript 5 a.a. sequence 
20 18. DPRP- 1 transcript 5 DNA sequence 

19. DPRP-1 transcript 6 a.a. sequence 

20. DPRP-1 transcript 6 DNA sequence 

21. DPRP-1 transcript 7 a.a. sequence 

22. DPRP-1 transcript 7 DNA sequence 
25 23. DPRP-2 transcript 0 a.a. sequence 

24. DPRP-2 transcript 0 DNA sequence 

25. DPRP-2 transcript 1 a.a. sequence 

26. DPRP-2 transcript 1 DNA sequence 

27. DPRP-2 transcript 2 a.a. sequence 
30 28. DPRP-2 transcript 2 DNA sequence 

29. DPRP-2 transcript 3 a.a. sequence 

30. DPRP-2 transcript 3 DNA sequence 

3 1 . DPRP-2 transcript 4 a.a. sequence 

32. DPRP-2 transcript 4 DNA sequence 
35 33. DPRP-2 transcript 5 a.a. sequence 
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34. DPRP-2 transcript 5 DNA sequence 

35. DPRP-2 transcript 6 a.a. sequence 

36. DPRP-2 transcript 6 DNA sequence 

37. DPRP-2 transcript 7 a.a. sequence 

5 38. DPRP-2 transcript 7 DNA sequence 

39. DPRP-2 transcript 8 a.a. sequence 

40. DPRP-2 transcript 8 DNA sequence 

41. DPRP-3 transcript 0 a.a. sequence 

42. DPRP-3 transcript 0 DNA. Sequence 
10 43. DPRP-3 transcript 1 a.a. sequence 

44. DPRP-3 transcript 1 DNA sequence 

45. DPRPl forward primer used for cloning 

46. DPRPl reverse primer used for cloning full length gene 

47. DPRPl reverse primer used for cloning fusion gene 
15 48. DPRP 1 forward primer used for expression profiling 

49. DPRPl reverse primer used for expression profiling 

50. DPRP2 forward primer used for cloning 

5 1 . DPRP2 reverse primer used for cloning full length gene 

52. DPRP2 reverse primer used for cloning fusion gene 
20 53, DPRP2 forward primer used for expression profiling 

54. DPRP2 reverse primer used for expression profiling 

55. DPRP3 forward primer used for cloning 

56. DPRP3 reverse primer used for cloning full length gene 

57. DPRP3 forward primer used for cloning fusion gene 
25 58. DPRP3 reverse primer used for cloning fusion gene 

59. DPRPl peptide antigen sequences 

60. DPRP2 peptide antigen sequences 

61. DPRP3 peptide antigen sequences 
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<110> Qi, Steve 

Akinsanya , Karen 
Riviere, Pierre 
Jun ien, Jean- Lou i s 

<12 0> NOVEL SERINE PROTEASE GENES RELATED TO DPPIV 

<130> 70669 

<150> US 60/240,117 

<151> 2000-10-12 

<160> 61 

<170> Patent In version 3.1 



<210> 1 

<211> 882 

<212> PRT 

<213> Homo sapiens 

<400> 1 



Met 


Ala 


Ala 


Ala 


1 








Thr 


Ala 


Asp 


Cys 








20 


Glu 


Pro 


Phe 


Tyr 






35 




Leu 


Ala 


Asp 


Thr 




50 






His 


Asp 


Phe 


Met 


65 








Asp 


Arg 


He 


Tyr 


Leu 


Phe 


Tyr 


Ser 








100 


Met 


Leu 


Ser 


Trp 






115 




Tyr 


Gly 


Met 


Tyr 




130 






He 


Gly 


Thr 


Val 


145 








Thr 


Phe 


Leu 


Phe 


Gly 


Pro 


Gin 


Gly 








180 


Thr 


Ser 


Cys 


Pro 






195 




Pro 


Asp 


Trp 


He 




210 






He 


val 


Thr 


Arg 


225 








Ala 


Asn 


Met 


Glu 



Met Glu Thr Glu 
5 

Glu Glu Asn He 

Val Glu Arg Tyr 
40 

Arg Lys Tyr His 
55 

Phe Val Lys Arg 
70 

Tyr Leu Ala Met 
85 

Glu He Pro Lys 

Lys Pro Leu Leu 
120 

Ser Arg Glu Glu 
135 

Gly He Ala Ser 
150 

Gin Ala Gly Ser 
165 

Phe Thr Gin Gin 

Asn He Arg Met 
200 

Ala Phe He His 
215 

Glu Glu Arg Arg 

230 

Glu Asp Ala Arg 
245 



Gin Leu Gly Val 
10 

Glu Ser Gin Asp 
25 

Ser Trp Ser Gin 

Gly Tyr Met Met 
60 

Asn Asp Pro Asp 
75 

Ser Gly Glu Asn 
90 

Thr He Asn Arg 
105 

Asp Leu Phe Gin 

Glu Leu Leu Arg 
140 

Tyr Asp Tyr His 
155 

Gly He Tyr His 
170 

Pro Leu Arg Pro 
185 

Asp Pro Lys Leu 

Ser Asn Asp He 
220 

Leu Thr Tyr Val 
235 

Ser Ala Gly Val 
250 



Glu He Phe Glu 
15 

Arg Pro Lys Leu 
30 

Leu Lys Lys Leu 
45 

Ala Lys Ala Pro 

Gly Pro His Ser 
80 

Arg Glu Asn Thr 
95 

Ala Ala Val Leu 
110 

Ala Thr Leu Asp 
125 

Glu Arg Lys Arg 

Gin Gly Ser Gly 
160 

val Lys Asp Gly 
175 

Asn Leu Val Glu 
190 

Cys Pro Ala Asp 
205 

Trp He Ser Asn 

His Asn Glu Leu 
240 

Ala Thr Phe Val 
255 



3 
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Leu Gin Glu Glu 
260 

Ala Glu Thr Thr 
275 

Glu Asn Asp Glu 
290 

Leu Glu Thr Arg 
305 

Ala Asn Pro Lys 

Glu Gly Arg lie 
340 

Glu lie Leu Phe 
355 

Pro Glu Gly Lys 
370 

Arg Leu Gin He 
385 

Asp Asp Val Met 

Val Thr Pro Leu 
420 

He His Asp He 
435 

Glu Phe He Phe 
450 

Lys He Thr Ser 
465 

Gly Leu Pro Ala 

Ala He Thr Ser 
500 

He Gin Val Asp 
515 

Asp Ser Pro Leu 
530 

Gly Glu Val Thr 
545 

He ser Gin His 

Asn Pro His Cys 
580 

Pro Thr Cys Lys 
595 

Gly Pro Leu Pro 
610 

Thr Thr Gly Phe 
625 

Gin Pro Gly Lys 

Gin Val Gin Leu 
660 

Leu Asn Thr Leu 
675 

Arg Gly Ser Cys 
690 

Lys Met Gly Gin 
705 



Phe Asp Arg Tyr 

Pro Ser Gly Gly 
280 

Ser Glu val Glu 
295 

Arg Ala Asp Ser 
310 

Val Thr Phe Lys 
325 

He Asp Val He 

Glu Gly Val Glu 
360 

Tyr Ala Trp Ser 
375 

Val Leu He Ser 
390 

Glu Arg Gin Arg 
405 

He He Tyr Glu 

Phe His Val Phe 
440 

Ala Ser Glu Cys 
455 

He Leu Lys Glu 
470 

Pro Ser Asp Phe 
485 

Gly Glu Trp Glu 

Glu val Arg Arg 
520 

Glu His His Leu 
535 

Arg Leu Thr Asp 
550 

Cys Asp Phe Phe 
565 

Val Ser Leu Tyr 

Thr Lys Glu Phe 
600 

Asp Tyr Thr Pro 
615 

Thr Leu Tyr Gly 
630 

Lys Tyr Pro Thr 
645 

Val Asn Asn Arg 

Ala Ser Leu Gly 
680 

His Arg Gly Leu 
695 

He Glu He Asp 
710 



ser Gly Tyr Trp 
265 

Lys He Leu Arg 

He He His Val 
300 

Phe Arg Tyr Pro 
315 

Met Ser Glu He 
330 

Asp Lys Glu Leu 
345 

Tyr He Ala Arg 

He Leu Leu Asp 
380 

Pro Glu Leu Phe 
395 

Leu He Glu Ser 
410 

Glu Thr Thr Asp 
425 

Pro Gin Ser His 

Lys Thr Gly Phe 
460 

Ser Lys Tyr Lys 
475 

Lys Cys Pro He 
490 

Val Leu Gly Arg 
505 

Leu Val Tyr Phe 

Tyr Val Val Ser 
540 

Arg Gly Tyr Ser 
555 

He Ser Lys Tyr 
570 

Lys Leu Ser Ser 
585 

Trp Ala Thr He 

Pro Glu He Phe 
620 

Met Leu Tyr Lys 
635 

Val Leu Phe He 
650 

Phe Lys Gly Val 
665 

Tyr Val Val Val 

Lys Phe Glu Gly 
700 

Asp Gin Val Glu 
715 



Trp Cys Pro Lys 
270 

He Leu Tyr Glu 
285 

Thr Ser Pro Met 

Lys Thr Gly Thr 
320 

Met He Asp Ala 
335 

He Gin Pro Phe 
350 

Ala Gly Trp Thr 
365 

Arg Ser Gin Thr 

He Pro Val Glu 
400 

Val Pro Asp Ser 
415 

He Trp He Asn 
430 

Glu Glu Glu He 
445 

Arg His Leu Tyr 

Arg Ser Ser Gly 
480 

Lys Glu Glu He 
495 

His Gly Ser Asn 
510 

Glu Gly Thr Lys 
525 

Tyr Val Asn Pro 

His Ser Cys Cys 
560 

Ser Asn Gin Lys 
575 

Pro Glu Asp Asp 
590 

Leu Asp Ser Ala 
€05 

Ser Phe Glu Ser 

Pro His Asp Leu 
640 

Tyr Gly Gly Pro 
655 

Lys Tyr Phe Arg 
670 

Val He Asp Asn 
685 

Ala Phe Lys Tyr 

Gly Leu Gin Tyr 
720 



4 
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Leu 


Ala 


Ser 


Arg 


TyT 


Asp 


Phe 


He 


Asp 


Leu 


Asp 


Arg 


Val 


Gly 


He 


His 










/ ^ 3 










/ J u 










/ Jb 




Gly 


Trp 


Ser 


Tyr 


Gly 


Gly 


Tyr 


Leu 


Ser 


Leu 


Met 


Ala 


Leu 


Met 


Gin 


Arg 








74 0 










745 










750 






Ser 


Asp 


He 


Phe 


Arg 


Val 


Ala 


He 


Ala 


Gly 


Ala 


Pro 


Val 


Thr 


Leu 


Trp 






755 










760 










765 








lie 


Phe 


Tyr 


Asp 


Thr 


Gly 


Tyr 


Thr 


Glu 


Arg 


Tyr 


Met 


Gly His 


Pro 


Asp 




770 










775 










780 










Gin 


Asn 


Glu 


Gin 


Gly 


Tyr 


Tyr 


Leu 


Gly 


Ser 


Val 


Ala 


Met 


Gin 


Ala 


Glu 


785 










790 










795 










800 


Lys 


Phe 


Pro 


Ser 


Glu 


Pro 


Asn 


Arg 


Leu 


Leu 


Leu 


Leu 


His 


Gly Phe 


Leu 










805 










810 










815 




Asp 


Glu 


Asn 


Val 


His 


Phe 


Ala 


His 


Thr 


Ser 


He 


Leu 


Leu 


Ser 


Phe 


Leu 








820 










825 










830 






Val 


Arg 


Ala 


Gly 


Lys 


Pro 


Tyr 


Asp 


Leu 


Gin 


He 


Tyr 


Pro 


Gin 


Glu 


Arg 






835 










840 










845 








His 


Ser 


He 


Arg 


Val 


Pro 


Glu 


Ser 


Gly 


Glu 


His 


Tyr 


Glu 


Leu 


His 


Leu 




850 










855 










860 










Leu 


His 


Tyr 


Leu 


Gin 


Glu 


Asn 


Leu 


Gly 


Ser 


Arg 


He 


Ala 


Ala 


Leu 


Lys 


865 










870 










875 










880 


Val 


He 































<210> 2 

<211> 2671 

<212> DNA 

<213> Homo sapiens 

<400> 2 



cggtaccatg gcagcagcaa tggaaacaga acagctgggt gttgagatat ttgaaactgc 60 

ggactgtgag gagaatattg aatcacagga tcggcctaaa ttggagcctt tttatgttga 120 

gcggtattcc tggagtcagc ttaaaaagct gcttgccgat accagaaaat atcatggcta 180 

catgatggct aaggcaccac atgatttcat gtttgtgaag aggaatgatc cagatggacc 240 

tcattcagac agaatctatt accttgccat gtctggtgag aacagagaaa atacactgtt 3 00 

ttattctgaa attcccaaaa ctatcaatag agcagcagtc ttaatgctct cttggaagcc 360 

tcttttggat ctttttcagg caacactgga ctatggaatg tattctcgag aagaagaact 420 

attaagagaa agaaaacgca ttggaacagt cggaattgct tcttacgatt atcaccaagg 480 

aagtggaaca tttctgtttc aagccggtag tggaatttat cacgtaaaag atggagggcc 540 

acaaggattt acgcaacaac ctttaaggcc caatctagtg gaaactagtt gtcccaacat 600 

acggatggat ccaaaattat gccctgctga tccagactgg attgctttta tacatagcaa 660 

cgatatttgg atatctaaca tcgtaaccag agaagaaagg agactcactt atgtgcacaa 72 0 

tgagctagcc aacatggaag aagatgccag atcagctgga gtcgctacct ttgttctcca 780 

agaagaattt gatagatatt ctggctattg gtggtgtcca aaagctgaaa caactcccag 840 

tggtggtaaa attcttagaa ttctatatga agaaaatgat gaatctgagg tggaaattat 900 

tcatgttaca tcccctatgt tggaaacaag gagggcagat tcattccgtt atcctaaaac 960 

aggtacagca aatcctaaag tcacttttaa gatgtcagaa ataatgattg atgctgaagg 1020 

aaggatcata gatgtcatag ataaggaact aattcaacct tttgagattc tatttgaagg 1080 

agttgaatat attgccagag ctggatggac tcctgaggga aaatatgctt ggtccatcct 1140 

actagatcgc tcccagactc gcctgcagat agtgttgatc tcacctgaat tatttatccc 1200 

agtagaagat gatgttatgg aaaggcagag actcattgag tcagtgcctg attctgtgac 1260 

gccactaatt atctatgaag aaacaacaga catctggata aatatccatg acatctttca 1320 

tgtttttccc caaagtcacg aagaggaaat tgagtttatt tttgcctctg aatgcaaaac 1380 

aggtttccgt catttataca aaattacatc tattttaaag gaaagcaaat ataaacgatc 1440 

cagtggtggg ctgcctgctc caagtgattt caagtgtcct atcaaagagg agatagcaat 1500 

taccagtggt gaatgggaag ttcttggccg gcatggatct aatatccaag ttgatgaagt 1560 

cagaaggctg gtatattttg aaggcaccaa agactcccct ttagagcatc acctgtacgt 162 0 

agtcagttac gtaaatcctg gagaggtgac aaggctgact gaccgtggct actcacattc 168 0 

ttgctgcatc agtcagcact gtgacttctt tataagtaag tatagtaacc agaagaatcc 174 0 

acactgtgtg tccctttaca agctatcaag tcctgaagat gacccaactt gcaaaacaaa 1600 

ggaattttgg gccaccattt tggattcagc aggtcctctt cctgactata ctcctccaga 1860 

aattttctct tttgaaagta ctactggatt tacattgtat gggatgctct acaagcctca 1920 
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tgatctacag 
gcagttggtg 
tctaggttat 
tgaaggcgcc 
ccaatatcta 
gtcctatgga 
tgctattgct 
ttatatgggt 
agcagaaaag 
gaatgtccat 
atatgattta 
acattatgaa 
tctaaaagtg 



cctggaaaga 
aataatcgat 
gtggttgtag 
tttaaatata 
gcttctcgat 
ggatacctct 
ggggccccag 
caccctgacc 
ttcccctctg 
tttgcacata 
cagatctatc 
ctgcatcttt 
atatgagcgg 



aatatcctac 
ttaaaggagt 
tgatagacaa 
aaatgggtca 
atgatttcat 
ccctgatggc 
tcactctgtg 
agaatgaaca 
aaccaaatcg 
ccagtatatt 
ctcaggagag 
tgcactacct 
ccgcgagctc 



tgtgctgttc 
caagtatttc 
caggggatcc 
aatagaaatt 
tgacttagat 
attaatgcag 
gatcttctat 
gggctattac 
tttactgctc 
actgagtttt 
acacagcata 
tcaagaaaac 
c 



atatatggtg 
cgcttgaata 
tgtcaccgag 
gacgatcagg 
cgtgtgggca 
aggtcagata 
gatacaggat 
ttaggatctg 
ttacatggtt 
ttagtgaggg 
agagttcctg 
cttggatcac 



gtcctcaggt 
ccctagcctc 
ggcttaaatt 
tggaaggact 
tccacggctg 
tcttcagggt 
acacggaacg 
tggccatgca 
tcctggatga 
ctggaaagcc 
aatcgggaga 
gtattgctgc 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2671 



<210> 3 

<211> 863 

<212> PRT 

<213> Homo 

<400> 3 



sapiens 



Met 


Ala 


Thr 


Thr 


Glv 


Thr 


Pro 


Thr 


Ala 


Asp 


Arg 


Gly 


Asp 


Ala 


Ala 


Ala 


1 








5 










10 










15 




Tlix 


Asp 


Asp 


Pro 


Ala 


Ala 


Arq 


Phe 


Gin 


Val 


Gin 


Lys 


His 


Ser 


Trp 


Asp 








20 










25 










30 






Gly 


Leu 


Arg 


Ser 


He 


He 


His 


Gly 


Ser 


Arg 


Lys 


Tyr 


Ser 


Gly 


Leu 


He 






35 










40 










45 








Val 


Asn 


Lys 


Ala 


Pro 


His 


Asp 


Phe 


Gin 


Phe 


Val 


Gin 


Lys 


Thr 


Asp 


Glu 




50 










55 










60 










Ser 


Gly 


Pro 


His 


Ser 


His 


Arg 


Leu 


Tyr 


Tyr 


Leu 


Gly 


Met 


Pro 


Tyr 


Gly 


65 










70 










75 










80 


Ser 


Arg 


Glu 


Asn 


Ser 


Leu 


Leu 


Tyr 


Ser 


Glu 


He 


Pro 


Lys 


Lys 


Val 


Arg 










85 










90 










95 




Lys 


Glu 


Ala 


Leu 


Leu 


Leu 


Leu 


Ser 


Trp 


Lys 


Gin 


Met 


Leu 


Asp 


His 


Phe 






100 










105 










110 






Gin 


Ala 


Thr 


Pro 


His 


His 


Gly 


Val 


Tyr 


Ser 


Arg 


Glu 


Glu 


Glu 


Leu 


Leu 






115 










120 










125 








Arg 


Glu 


Arg 


Lys 


Arg 


Leu 


Gly 


Val 


Phe 


Gly 


He 


Thr 


Ser 


Tyr 


Asp 


Phe 




130 










135 










140 










His 


Ser 


Glu 


Ser 


Gly 


Leu 


Phe 


Leu 


Phe 


Gin 


Ala 


Ser 


Asn 


Ser 


Leu 


Phe 


145 










150 










155 










160 


His 


Cys 


Arg 


Asp 


Gly 


Gly 


Lys 


Asn 


Gly 


Phe 


Met 


Val 


Ser 


Pro 


Met 


Lys 










165 










170 










175 




Pro 


Leu 


Glu 


He 


Lys 


Thr 


Gin 


Cys 


Ser 


Gly 


Pro 


Arg 


Met 


Asp 


Pro 


Lys 








180 










185 










190 






He 


Cys 


Pro 


Ala 


Asp 


Pro 


Ala 


Phe 


Phe 


Ser 


Phe 


He 


Asn 


Asn 


Ser 


Asp 






195 










200 










205 








Leu 


Trp 


Val 


Ala 


Asn 


He 


Glu 


Thr 


Gly 


Glu 


Glu 


Arg 


Arg 


Leu 


Thr 


Phe 




210 










215 










220 










Cys 


His 


Gin 


Gly 


Leu 


Ser 


Asn 


Val 


Leu 


Asp 


Asp 


Pro 


Lys 


Ser 


Ala 


Gly 


225 










230 










235 










240 


Val 


Ala 


Thr 


Phe 


Val 


He 


Gin 


Glu 


Glu 


Phe 


Asp 


Arg 


Phe 


Thr 


Gly 


Tyr 










245 










250 










255 




Trp 


Trp 


Cys 


Pro 


Thr 


Ala 


Ser 


Trp 


Glu 


Gly 


Ser 


Glu 


Gly 


Leu 


Lys 


Thr 








260 










265 










270 






Leu 


Arg 


He 


Leu 


Tyr 


Glu 


Glu 


Val 


Asp 


Glu 


Ser 


Glu 


Val 


Glu 


Val 


He 






275 










280 










285 








His 


Val 


Pro 


Ser 


Pro 


Ala 


Leu 


Glu 


Glu 


Arg 


Lys 


Thr 


Asp 


Ser 


Tyr 


Arg 




290 










295 










300 










Tyr 


Pro 


Arg 


Thr 


Gly 


Ser 


Lys 


Asn 


Pro 


Lys 


He 


Ala 


Leu 


Lys 


Leu 


Ala 


305 










310 










315 










320 



6 
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Glu Phe Gin Thr 

Glu Leu Val Gin 
340 

Ala Axg Ala Gly 
355 

Leu Asp Arg Pro 
370 

Leu Phe He Pro 
385 

Arg Ala Val Pro 

Thr Asn Val Trp 
420 

Ser Glu Gly Glu 
435 

Thr Gly Phe Cys 
450 

Gly Tyr Asp Trp 
465 

Cys Pro He Lys 

Leu Ala Arg His 
500 

Val Tyr Phe Gin 
515 

Val Val Ser Tyr 
530 

Gly Phe Ser His 
545 

Ser His Tyr Ser 

Leu Ser Gly Pro 
580 

Ala Ser Met Met 
595 

Glu He Phe His 
610 

He Tyr Lys Pro 
625 

Leu Phe Val Tyr 

Lys Gly He Lys 
660 

Ala Val Val Val 
675 

Phe Glu Gly Ala 
690 

Gin Val Glu Gly 
705 

Leu Ser Arg Val 

Leu Met Gly Leu 
740 

Gly Ala Pro Val 
755 



Asp Ser Gin Gly 
325 

Pro Phe Ser Ser 

Trp Thr Arg Asp 
360 

Gin Gin Trp Leu 
375 

Ser Thr Glu Asn 
390 

Arg Asn Val Gin 
405 

He Asn Val His 

Asp Glu Leu Cys 
440 

His Leu Tyr Lys 
455 

Ser Glu Pro Phe 
470 

Glu Glu He Ala 
485 

Gly Ser Lys He 

Gly Thr Lys Asp 
520 

Glu Ala Ala Gly 
535 

Ser Cys Ser Met 
550 

Ser Val Ser Thr 
565 

Asp Asp Asp Pro 

Glu Ala Ala Ser 
600 

Phe His Thr Arg 
615 

His Ala Leu Gin 
630 

Gly Gly Pro Gin 
645 

Tyr Leu Arg Leu 

He Asp Gly Arg 
680 

Leu Lys Asn Gin 
695 

Leu Gin Phe Val 

710 

Ala He His Gly 
725 

He His Lys Pro 

Thr Val Trp Met 
760 



Lys He Val Ser 
330 

Leu Phe Pro Lys 
345 

Gly Lys Tyr Ala 

Gin Leu Val Leu 
380 

Glu Glu Gin Arg 
395 

Pro Tyr Val Val 
410 

Asp He Phe Tyr 
425 

Phe Leu Arg Ala 

Val Thr Ala Val 
460 

Ser Pro Gly Glu 
475 

Leu Thr Ser Gly 
490 

Trp Val Asn Glu 
505 

Thr Pro Leu Glu 

Glu He Val Arg 
540 

Ser Gin Asn Phe 
555 

Pro Pro Cys Val 
570 

Leu His Lys Gin 
585 

Cys Pro Pro Asp 

Ser Asp Val Arg 
620 

Pro Gly Lys Lys 
635 

Val Gin Leu Val 
650 

Asn Thr Leu Ala 
665 

Gly Ser Cys Gin 

Met Gly Gin Val 
700 

Ala Glu Lys Tyr 
715 

Trp Ser Tyr Gly 
730 

Gin Val Phe Lys 
745 

Ala Tyr Asp Thr 



Thr Gin Glu Lys 
335 

Val Glu Tyr He 
350 

Trp Ala Met Phe 
365 ' 

Leu Pro Pro Ala 

Leu Ala Ser Ala 
400 

Tyr Glu Glu Val 
415 

Pro Phe Pro Gin 
430 

Asn Glu Cys Lys 
445 

Leu Lys Ser Gin 

Asp Glu Phe Lys 
480 

Glu Trp Glu Val 
495 

Glu Thr Lys Leu 
510 

His His Leu Tyr 
525 

Leu Thr Thr Pro 

Asp Met Phe Val 
560 

His Val Tyr Lys 

575 

Pro Arg Phe Trp 
590 

Tyr Val Pro Pro 
605 

Leu Tyr Gly Met 

His Pro Thr Val 
640 

Asn Asn Ser Phe 
655 

Ser Leu Gly Tyr 
670 

Arg Gly Leu Arg 
685 

Glu He Glu Asp 

Gly Phe He Asp 
720 

Gly Phe Leu Ser 
735 

Val Ala He Ala 
750 

Gly Tyr Thr Glu 

765 



wo 02/31134 



PCT/USOl/31874 



Arg 


Tyr 


Met 


Asp Val 


Pro 


Glu 


Asn 


Asn 


Gin 


His 


Gly 


Tyr 


Glu 


Ala 


Gly 




770 










775 










780 








Ser 


Val 


Ala 


Leu 


His 


Val 


Glu 


Lys 


Leu 


Pro 


Asn 


Glu 


Pro 


Asn 


Arg 


Leu 


785 










790 










795 








800 


Leu 


He 


Leu 


His 


Gly 


Phe 


Leu Asp 


Glu 


Asn 


Val 


His 


Phe 


Phe 


His 


Thr 










805 










810 










815 




Asn 


Phe 


Leu 


Val 


Ser 


Gin 


Leu 


He 


Arg Ala 


Gly 


Lys 


Pro 


Tyr 


Gin 


Leu 








820 










825 










830 






Gin 


He 


Tyr 


Pro 


Asn 


Glu 


Arg 


His 


Ser 


He 


Arg 


Cys 


Pro 


Glu 


Ser 


Gly 






835 










840 










845 






Glu 


His 


Tyr 


Glu 


Val 


Thr 


Leu 


Leu 


His 


Phe 


Leu 


Gin 


Glu 


Tyr 


Leu 





850 855 860 



<210> 4 

<211> 2617 

<212> DNA 

<213> Homo sapiens 

<400> 4 

caagcttacc atggccacca ccgggacccc aacggccgac cgaggcgacg cagccgccac 60 

agatgacccg gccgcccgct tccaggtgca gaagcactcg tgggacgggc tccggagcat 12 0 

catccacggc agccgcaagt actcgggcct cattgtcaac aaggcgcccc acgacttcca 180 

gttfcgtgcag aagacggatg agtctgggcc ccactcccac cgcctctact acctgggaat 24 0 

gccatatggc agccgagaga actccctcct ctactctgag attcccaaga aggtccggaa 3 00 

agaggctctg ctgctcctgt cctggaagca gatgctggat catttccagg ccacgcccca 360 

ccatggggtc tactctcggg aggaggagct gctgagggag cggaaacgcc tgggggtctt 420 

cggcatcacc tcctacgact tccacagcga gagtggcctc ttcctcttcc aggccagcaa 4 80 

cagcctcttc cactgtcgcg acggcggcaa gaacggcttc atggtgtccc ctatgaaacc 540 

gctggaaatc aagacccagt gctcagggcc ccggatggac cccaaaatct gccctgccga 600 

ccctgccttc ttctccttca tcaataacag cgacctgtgg gtggccaaca tcgagacagg 660 

cgaggagcgg cggctgacct tctgccacca aggtttatcc aatgtcctgg atgaccccaa 720 

gtctgcgggt gtggccacct tcgtcataca ggaagagttc gaccgcttca ctgggtactg 780 

gtggtgcccc acagcctcct gggaaggttc agagggcctc aagacgctgc gaatcctgta 84 0 

tgaggaagtc gatgagtccg aggtggaggt cattcacgtc ccctctcctg cgctagaaga 900 

aaggaagacg gactcgtatc ggtaccccag gacaggcagc aagaatccca agattgcctt 960 

gaaactggct gagttccaga ctgacagcca gggcaagatc gtctcgaccc aggagaagga 1020 

gctggtgcag cccttcagct cgctgttccc gaaggtggag tacatcgcca gggccgggtg 1080 

gacccgggat ggcaaatacg cctgggccat gttcctggac cggccccagc agtggctcca 114 0 

gctcgtcctc ctccccccgg ccctgttcat cccgagcaca gagaatgagg agcagcggct 12 00 

agcctctgcc agagctgtcc ccaggaatgt ccagccgtat gtggtgtacg aggaggtcac 1260 

caacgtctgg atcaatgttc atgacatctt ctatcccttc ccccaatcag agggagagga 1320 

cgagctctgc tttctccgcg ccaatgaatg caagaccggc ttctgccatt tgtacaaagt 13 80 

caccgccgtt ttaaaatccc agggctacga ttggagtgag cccttcagcc ccggggaaga 1440 

tgaatttaag tgccccatta aggaagagat tgctctgacc agcggtgaat gggaggtttt 15 00 

ggcgaggcac ggctccaaga tctgggtcaa tgaggagacc aagctggtgt acttccaggg 1560 

caccaaggac acgccgctgg agcaccacct ctacgtggtc agctatgagg cggccggcga 162 0 

gatcgtacgc ctcaccacgc ccggcttctc ccatagctgc tccatgagcc agaacttcga 1680 

catgttcgtc agccactaca gcagcgtgag cacgccgccc tgcgtgcacg tctacaagct 1740 

gagcggcccc gacgacgacc ccctgcacaa gcagccccgc ttctgggcta gcatgatgga 1800 

ggcagccagc tgccccccgg attatgttcc tccagagatc ttccatttcc acacgcgctc 1860 

ggatgtgcgg ctctacggca tgatctacaa gccccacgcc ttgcagccag ggaagaagca 1920 

ccccaccgtc ctctttgtat atggaggccc ccaggtgcag ctggtgaata actccttcaa 1980 

aggcatcaag tacttgcggc tcaacacact ggcctccctg ggctacgccg tggttgtgat 2 040 

tgacggcagg ggctcctgtc agcgagggct tcggttcgaa ggggccctga aaaaccaaat 2100 

gggccaggtg gagatcgagg accaggtgga gggcctgcag ttcgtggccg agaagtatgg 2160 

cttcatcgac ctgagccgag ttgccatcca tggctggtcc tacgggggct tcctctcgct 2220 

catggggcta atccacaagc cccaggtgtt caaggtggcc atcgcgggtg ccccggtcac 2280 

cgtctggatg gcctacgaca cagggtacac tgagcgctac atggacgtcc ctgagaacaa 2340 

ccagcacggc tatgaggcgg gttccgtggc cctgcacgtg gagaagctgc ccaatgagcc 2400 

caaccgcttg cttatcctcc acggcttcct ggacgaaaac gtgcactttt tccacacaaa 2460 
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cttcctcgtc tcccaactga tccgagcagg gaaaccttac cagctccaga tctaccccaa 252 0 
cgagagacac agtattcgct gccccgagtc gggcgagcac tatgaagtca cgttgctgca 2580 
ctttctacag gaatacctct gagcggccgc ggatccg 2617 

<210> 5 

<211> 796 

<212> PRT 

<213> Homo sapiens 

<400> 5 



Met 


Asn 


Gin 


Thr 


Ala 


Ser 


Val 


Ser 


His 


His 


He 


Lys 


Cys 


Gin 


Pro 


Ser 


1 








5 










10 










15 




Lys 


Thr 


He 


Lys 


Glu 


Leu 


Gly 


Ser 


Asn 


Ser 


Pro 


Pro 


Gin 


Arg 


Asn 


Trp 






20 










25 










30 






Lys 


Gly 


He 


Ala 


He 


Ala 


Leu 


Leu 


Val 


He 


Leu 


Val 


Val 


Cys 


Ser 


Leu 




35 










40 










45 








lie 


Thr 


Met 


Ser 


Val 


He 


Leu 


Leu 


Ser 


Pro 


Asp 


Glu 


Leu 


Thr 


Asn 


Ser 




50 










55 










60 










Ser 


Glu 


Thr 


Arg 


Leu 


Ser 


Leu 


Glu 


Asp 


Leu 


Phe 


Arg 


Lys 


Asp 


Phe 


Val 


65 










70 










75 










80 


Leu 


His 


Asp 


Pro 


Glu 


Ala 


Arg 


Trp 


He 


Asn 


Asp 


Thr 


Asp 


Val 


val 


Tyr 










85 










90 










95 




Lys 


Ser 


Glu 


Asn 


Gly 


His 


Val 


He 


Lys 


Leu 


Asn 


He 


Glu 


Thr 


Asn 


Ala 






100 










105 










110 






Thr 


Thr 


Leu 


Leu 


Leu 


Glu 


Asn 


Thr 


Thr 


Phe 


Val 


Thr 


Phe 


Lys 


Ala 


Ser 






115 










120 










125 








Arg 


His 


Ser 


Val 


Ser 


Pro 


Asp 


Leu 


Lys 


Tyr 


Val 


Leu 


Leu 


Ala 


Tyr 


Asp 


130 










135 










140 










Val 


Lys 


Gin 


He 


Phe 


His 


Tyr 


Ser 


Tyr 


Thr 


Ala 


Ser 


Tyr 


Val 


He 


Tyr 


145 










150 










155 










160 


Asn 


He 


His 


Thr 


Arg 


Glu 


Val 


Trp 


Glu 


Leu 


Asn 


Pro 


Pro 


Glu 


Val 


Glu 










165 










170 










175 




Asp 


Ser 


Val 


Leu 


Gin 


Tyr 


Ala 


Ala 


Trp 


Gly Val 


Gin Gly Gin Gin 


Leu 






180 










185 










190 






lie 


Tyr 


He 


Phe 


Glu 


Asn 


Asn 


He 


Tyr 


Tyr 


Gin 


Pro 


Asp 


He 


Lys 


Ser 






195 










200 










205 








Ser 


Ser 


Leu 


Arg 


Leu 


Thr 


Ser 


Ser 


Gly 


Lys 


Glu 


Glu 


He 


He 


Phe 


Asn 




210 










215 










220 










Gly 


He 


Ala 


Asp 


Trp 


Leu 


Tyr 


Glu 


Glu 


Glu 


Leu 


Leu 


His 


Ser 


His 


He 


225 










230 










235 










240 


Ala 


His 


Trp 


Trp 


Ser 


Pro 


Asp 


Gly 


Glu 


Arg 


Leu 


Ala 


Phe 


Leu 


Met 


He 










245 










250 










255 




Asn 


Asp 


Ser 


Leu 


Val 


Pro 


Thr 


Met 


Val 


He 


Pro 


Arg 


Phe 


Thr 


Gly Ala 






260 










265 










270 






Leu 


Tyr 


Pro 


Lys 


Gly 


Lys 


Gin 


Tyr 


Pro 


Tyr 


Pro 


Lys 


Ala 


Gly Gin Val 






275 










280 










285 








Asn 


Pro 


Thr 


He 


Lys 


Leu 


Tyr 


Val 


Val 


Asn 


Leu 


Tyr Gly 


Pro 


Thr 


His 




290 










295 










300 










Thr 


Leu 


Glu 


Leu 


Met 


Pro 


Pro 


Asp 


Ser 


Phe 


Lys 


Ser 


Arg 


Glu 


Tyr 


Tyr 


305 










310 










315 










320 


He 


Thr 


Met 


Val 


Lys 


Trp 


Val 


Ser 


Asn 


Thr 


Lys 


Thr 


val 


val 


Arg 


Trp 










325 










330 










335 




Leu 


Asn 


Arg 


Pro 


Gin 


Asn 


He 


Ser 


He 


Leu 


Thr 


Val 


Cys 


Glu 


Thr 


Thr 








340 










345 










350 






Thr 


Gly 


Ala 


Cys 


Ser 


Lys 


Lys 


Tyr 


Glu 


Met 


Thr 


Ser 


Asp 


Thr 


Trp 


Leu 






355 










360 










365 








Ser 


Gin 


Gin 


Asn 


Glu 


Glu 


Pro 


Val 


Phe 


Ser 


Arg 


Asp 


Gly 


Ser 


Lys 


Phe 




370 










375 










380 










Phe 


Met 


Thr 


val 


Pro 


Val 


Lys 


Gin 


Gly 


Gly 


Arg 


Gly 


Glu 


Phe 


His 


His 


385 










390 










395 










400 
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Xle Ala Met Phe Leu lie oln Ser .ys Ser clu Gin He Thr val Arg 
Hi. .au Thr ser o'ly Asa Trp Glu Val He Lys He Leu Ala Tyr Asp 
Clu Thr Thr III Lys He Tyr Phe Leu Ser Thr Glu Ser Ser Pro Arg 
Gly Arg "in Leu Tyr Ser Ala Ser Thr Glu Gly Leu Leu Asn Arg Gin 
cys fie ser Cys Asn Phe Met Lys Glu Gin Cys Thr Tyr Phe Asp Ala 
ser Phe Ser Pro Met Tsn Gin His Phe Leu Leu Phe Cys Glu Gly Pro 
val pro val Val Ser Leu His Ser Thr Asp Asn Pro Ala Lys Tyr 
Phe He Leu III Ser Asn Ser Met Leu Lys Glu Ala He Leu Lys Lys 
.ys He G^y Lys Pro Glu He Lys He Leu His He Asp Asp Tyr Glu 
X.eu pro Leu Gin Leu Ser Leu Pro Lys Asp Phe Met Asp Arg Asn Gin 
^ .la Leu Leu Leu He Met Asp Glu Glu Pro Gly Gly Gin Leu Val 
Thr ASP Lys Phe nfs He Asp Trp Asp ser Val Leu He Asp Met Asp 
.sn val He III Ala Arg Phe Asp Gly Arg Gly Ser Gly Phe Gin Gly 
.eu Lys III Leu Gin Glu He His Arg Arg Leu Gly Ser val Glu Val 
Lys ill Gin He Thr Ala vll Lys Phe Leu Leu Lys Leu Pro Tyr He 
Zl ser Lys Arg Leu III He Phe Gly Lys Gly Tyr Gly Gly Tyr He 
ser Met He Leu Lys Ser Asp Glu Lys Leu Phe Lys Cys Gly Ser 
val val Ala III He Thr Asp Leu Lys Leu Tyr Ala Ser Ala Phe Ser 
Glu Arg Leu Gly Met Pro Tel Lys Glu Glu Ser Thr Tyr Gin Ala 

;aa Zl val Leu His Asn Val His Gly Leu Lys Glu Glu Asn He Leu 

"710 / X D 

III Xle His Gly Thr Ala Asp Thr Lys Val His Phe Gin His Ser Ala 
Olu Leu He Lys His Leu He Lys Ala Gly Val Asn Tyr Thr Met Gin 
val Tyr Pro Isp Glu Gly His Asn Val Ser Glu Lys Ser Lys Tyr His 

.eu Tyr Ser Thr He Leu Lys Phe Phe Ser Asp Cys Leu Lys Glu Glu 

775 '° 



lie III val Leu Pro Gin Gl^i Pro Glu Glu Asp Glu 



785 

<210> 6 

<211> 2583 

<212> DNA 

<213> Homo sapiens 

<400> 6 



790 



795 



,cc.gge.tt .t^cact^tc casggtcctg .aa.atga.c c.aa«.=ca gc,tgtc=ca ^60 
IZll'lll llTslHt'l Sa.f...a. e=tc.c«at .so 



780 
840 
900 
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cactatgtca gtcatcctct taagcccaga tgaactcaca aattcgtcag aaaccagatt 240 

gtctttggaa gacctcttta ggaaagactt tgtgcttcac gatccagagg ctcggtggat 300 

caatgataca gatgtggtgt ataaaagcga gaatggacat gtcattaaac tgaatataga 360 

aacaaatgct accacattat tattggaaaa cacaactttt gtaaccttca aagcatcaag 420 

acattcagtt tcaccagatt taaaatatgt ccttctggca tatgatgtca aacagatttt 480 

tcattattcg tatactgctt catatgtgat ttacaacata cacactaggg aagtttggga 540 

gttaaatcct ccagaagtag aggactccgt cttgcagtac gcggcctggg gtgtccaagg 600 

gcagcagctg atttatattt ttgaaaataa tatctactat caacctgata taaagagcag 660 

ttcattgcga ctgacatctt ctggaaaaga agaaataatt tttaatggga ttgctgactg 720 
gttatatgaa gaggaactcc tgcattctca catcgcccac tggtggtcac cagatggaga 
aagacttgcc ttcctgatga taaatgactc tttggtaccc accatggtta tccctcggtt 
tactggagcg ttgtatccca aaggaaagca gtatccgtat cctaaggcag gtcaagtgaa 

cccaacaata aaattatatg ttgtaaacct gtatggacca actcacactt tggagctcat 960 

gccacctgac agctttaaat caagagaata ctatatcact atggttaaat gggtaagcaa 1020 

taccaagact gtggtaagat ggttaaaccg acctcagaac atctccatcc tcacagtctg 1080 

tgagaccact acaggtgctt gtagtaaaaa atatgagatg acatcagata cgtggctctc 1140 

tcagcagaat gaggagcccg tgttttctag agacggcagc aaattcttta tgacagtgcc 1200 

tgttaagcaa gggggacgtg gagaatttca ccacatagct atgttcctca tccagagtaa 1260 

aagtgagcaa attaccgtgc ggcatctgac atcaggaaac tgggaagtga taaagatctt 1320 

ggcatacgat gaaactactc aaaaaattta ctttctgagc actgaatctt ctcccagagg 1380 

aaggcagctg tacagtgctt ctactgaagg attattgaat cgccaatgca tttcatgtaa 1440 

tttcatgaaa gaacaatgta catattttga tgccagtttt agtcccatga atcaacattt 1500 

cttattattc tgtgaaggtc caagggtccc agtggtcagc ctacatagta cggacaaccc 1560 

agcaaaatat tttatattgg aaagcaattc tatgctgaag gaagctatcc tgaagaagaa 162 0 

gataggaaag ccagaaatta aaatccttca tattgacgac tatgaacttc ctttacagtt 1680 

gtcccttccc aaagatttta tggaccgaaa ccagtatgct cttctgttaa taatggatga 1740 

agaaccagga ggccagctgg ttacagataa gttccatatt gactgggatt ccgtactcat 1800 

tgacatggat aatgtcattg tagcaagatt tgatggcaga ggaagtggat tccagggtct 1860 

gaaaattttg caggagattc atcgaagatt aggttcagta gaagtaaagg accaaataac 1920 

agctgtgaaa tttttgctga aactgcctta cattgactcc aaaagattaa gcatttttgg 1980 

aaagggttat ggtggctata ttgcatcaat gatcttaaaa tcagatgaaa agctttttaa 2040 

atgtggatcc gtggttgcac ctatcacaga cttgaaattg tatgcctcag ctttctctga 2100 

aagatacctt gggatgccat ctaaggaaga aagcacttac caggcagcca gtgtgctaca 2160 

taatgttcat ggcttgaaag aagaaaatat attaataatt catggaactg ctgacacaaa 2220 

agttcatttc caacactcag cagaattaat caagcaccta ataaaagctg gagtgaatta 2280 

tactatgcag gtctacccag atgaaggtca taacgtatct gagaagagca agtatcatct 2340 

ctacagcaca atcctcaaat tcttcagtga ttgtttgaag gaagaaatat ctgtgctacc 2400 

acaggaacca gaagaagatg aataatggac cgtatttata cagaactgaa gggaatattg 2460 

aggctcaatg aaacctgaca aagagactgt aatattgtag ttgctccaga atgtcaaggg 2520 

cagcttacgg agatgtcact ggagcagcac gctcagagac agtgaactag catttgaata 2580 

2563 



cac 
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Met Ala Ala 


Ala 


Met 


Glu 


Thr 


Glu 


1 




5 








Thr Ala Asp 


Cys 


Glu 


Glu 


Asn 


He 




20 










Glu Pro Phe 


Tyr 


val 


Glu 


Arg 


Tyr 


35 










40 


Leu Ala Asp 


Thr 


Arg 


Lys 


Tyr 


His 


50 








55 




His Asp Phe 


Met 


Phe 


val 


Lys 


Arg 


65 






70 






Asp Arg He 


Tyr 


Tyr 


Leu 


Ala 


Met 



85 



Gin 


Leu 


Gly 


Val 


Glu 


He 


Phe 


Glu 




10 










15 




Glu 


Ser 


Gin 


Asp 


Arg 


Pro 


Lys 


Leu 


25 










30 






Ser 


Trp 


Ser 


Gin 


Leu 


Lys 


Lys 


Leu 










45 








Gly 


Tyr 


Met 


Met 


Ala 


Lys 


Ala 


Pro 








60 










Asn 


Asp 


Pro 


Asp 


Gly 


Pro 


His 


Ser 






75 










80 


Ser 


Gly 


Glu 


Asn 


Arg 


Glu 


Asn 


Thr 




90 










95 





n 
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Leu Phe Tyr Ser 
100 

Met Leu Ser Trp 
115 

Tyr Gly Met Tyr 
130 

He Gly Thr Val 
145 

Thr Phe Leu Phe 

Gly Pro Gin Gly 
180 

Thr Ser Cys Pro 
195 

Pro Asp Trp He 
210 

He Val Thr Arg 
225 

Ala Asn Met Glu 

Leu Gin Glu Glu 
260 

Ala Glu Thr Thr 
275 

Glu Asn Asp Glu 
290 

Leu Glu Thr Arg 
305 

Ala Asn Pro Lys 

Glu Gly Arg He 
340 

Glu He Leu Phe 
355 

Pro Glu Gly Lys 
370 

Arg Leu Gin He 
385 

Asp Asp Val Met 

Val Thr Pro Leu 
420 

He His Asp He 
435 

Glu Phe He Phe 
450 

Lys He Thr Ser 
465 

Gly Leu Pro Ala 

Ala He Thr Ser 
500 

He Gin Val Asp 
515 

Asp Ser Pro Leu 
530 

Gly Glu Val Thr 
545 

He Ser Gin His 



Glu 


He 


Pro 


Lys 


Lys 


Pro 


Leu 


Leu 








120 


Ser 


Arg 


Glu 


Glu 






135 




Gly 


He 


Ala 


Ser 




150 






Gin 


Ala 


Gly 


Ser 


165 








Phe 


Thr 


Gin 


Gin 


Asn 


He 


Arg 


Met 








200 


Ala 


Phe 


He 


His 






215 




Glu 


Glu 


Arg 


Arg 




230 






Glu 


Asp 


Ala 


Arg 


245 








Phe 


Asp 


Arg 


Tyr 


Pro 


Ser 


Gly 


Gly 








280 


Ser 


Glu 


Val 


Glu 






295 




Arg 


Ala 


Asp 


Ser 




310 






Val 


Thr 


Phe 


Lys 


325 








He 


Asp 


Val 


He 


Glu 


Gly 


Val 


Glu 








360 


Tyr 


Ala 


Trp 


Ser 






375 




Val 


Leu 


He 


Ser 




390 






Glu 


Arg 


Gin 


Arg 


405 








He 


He 


Tyr 


Glu 


Phe 


His 


Val 


Phe 








440 


Ala 


Ser 


Glu 


Cys 






A a cz 




He 


Leu 


Lys 


Glu 




470 






Pro 


Ser 


Asp 


Phe 


485 








Gly 


Glu 


Trp 


Glu 


Glu 


Val 


Arg 


Arg 








520 


Glu 


His 


His 


Leu 






535 




Arg 


Leu 


Thr 


Asp 




550 






Cys 


Asp 


Phe 


Phe 



Thr 


He 


Asn 


Arg 


105 








Asp 


Leu 


Phe 


Gin 


Glu 


Leu 


Leu 


Arg 








140 


Tyr 


Asp 


Tyr 


His 










Gly 


He 


Tyr 


His 




170 






Pro 


Leu 


Arg 


Pro 


185 








Asp 


Pro 


Lys 


Leu 


Ser 


Asn 


Asp 


He 








220 


Leu 


Thr 


Tyr 


Val 






235 




Ser 


Ala 


Gly 


Val 




250 






Ser 


Gly 


Tyr 


Trp 


265 








Lys 


He 


Leu 


Arg 


He 


He 


His 


Val 








300 


Phe 


Arg 


Tyr 


Pro 






315 




Met 


Ser 


Glu 


He 




330 






Asp 


Lys 


Glu 


Leu 


345 








Tyr 


He 


Ala 


Arg 


He 


Leu 


Leu 


Asp 








380 


Pro 


Glu 


Leu 


Phe 






395 




Leu 


He 


Glu 


Ser 




410 






Glu 


Thr 


Thr 


Asp 


425 








Pro 


Gin 


Ser 


His 


Lys 


Thr 


Gly 


Phe 








460 


Ser 


Lys 


Tyr 


Lys 






A n c 
4 7b 




Lys 


Cys 


Pro 


He 




490 






Val 


Leu 


Gly 


Arg 


505 








Leu 


Val 


Tyr 


Phe 


Tyr 


Val 


Val 


Ser 








540 


Arg 


Gly 


Tyr 


Ser 






555 




He 


Ser 


Lys 


Tyr 




570 







Ala 


Ala 


Val 


Leu 




110 






Ala 


Thr 


Leu 


Asp 


125 








Glu 


Arg 


Lys 


Arg 


Gin 


Gly 


Ser 


Gly 








160 


Val 


Lys 


Asp 


Gly 






± f D 




Asn 


Leu 


Val 


Glu 




190 






Cys 


Pro 


Ala 


Asp 


205 








Trp 


He 


Ser 


Asn 


His 


Asn 


Glu 


Leu 








240 


Ala 


Thr 


Phe 


Val 






255 




Trp 


Cys 


Pro 


Lys 




270 






He 


Leu 


Tyr 


Glu 


285 








Thr 


Ser 


Pro 


Met 


Lys 


Thr 


Gly 


Thr 








320 


Met 


He 


Asp 


Ala 






335 




He 


Gin 


Pro 


Phe 




350 






Ala 


Gly 


Trp 


Thr 


365 








Arg 


Ser 


Gin 


Thr 


He 


Pro 


Val 


Glu 








400 


Val 


Pro 


Asp 


Ser 










He 


Trp 


He 


Asn 




430 






Glu 


Glu 


Glu 


He 


445 








Arg 


His 


Leu 


Tyr 


Arg 


Ser 


Ser 


Gly 








480 


Lys 


Glu 


Glu 


He 










His 


Gly 


Ser 


Asn 




510 






Glu 


Gly 


Thr 


Lys 


525 








Tyr 


Val 


Asn 


Pro 


His 


Ser 


Cys 


Cys 








560 


Ser 


Asn 


Gin 


Lys 






575 
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t 

Asn Pro His Cys Val Ser Leu Tyr Lys Leu Ser Ser Pro Glu Asp Asp 

580 585 590 

Pro Thr Cys Lys Thr Lys Glu Phe Trp Ala Thr lie Leu Asp Ser Ala 

595 600 605 

Gly Pro Leu Pro Asp Tyr Thr Pro Pro Glu lie Phe Ser Phe Glu Ser 

610 615 620 

Thr Thr Gly Phe Thr Leu Tyr Gly Met Leu Tyr Lys Pro His Asp Leu 
625 630 635 640 

Gin Pro Gly Lys Lys Tyr Pro Thr Val Leu Phe lie Tyr Gly Gly Arg 

645 650 655 

Leu Leu Leu Leu Gly Pro Gin Ser Leu Cys Gly Ser Ser Met lie Gin 

660 665 670 

Asp Thr Arg Asn Val lie Trp Val Thr Leu Thr Arg Met Asn Arg Ala 
675 680 685 

He Thr 
690 



<210> 8 

<211> 4523 

<212> DNA 

<213> Homo sapiens 

<400> 8 



aagtgctaaa 
cgttcgccgc 
gagtggaggc 
tccgggcggg 
ctgggtgttg 
cctaaattgg 
gccgatacca 
gtgaagagga 
ggtgagaaca 
gcagtcttaa 
ggaatgtatt 
attgcttctt 
atttatcacg 
ctagtggaaa 
gactggattg 
gaaaggagac 
gctggagtcg 
tgtccaaaag 
aatgatgaat 
gcagattcat 
tcagaaataa 
caaccttttg 
gagggaaaat 
ttgatctcac 
attgagtcag 
tggataaata 
tttatttttg 
ttaaaggaaa 
tgtcctatca 
ggatctaata 
tcccctttag 
ctgactgacc 
agtaagtata 
gaagatgacc 
cctcttcctg 
ttgtatggga 



gcctccgagg 
ctgggttgtc 
ggcgcagcat 
gccgggggga 
agatatttga 
agccttttta 
gaaaatatca 
atgatccaga 
gagaaaatac 
tgctctcttg 
ctcgagaaga 
acgattatca 
taaaagatgg 
ctagttgtcc 
cttttataca 
tcacttatgt 
ctacctttgt 
ctgaaacaac 
ctgaggtgga 
tccgttatcc 
tgattgatgc 
agattctatt 
atgcttggtc 
ctgaattatt 
tgcctgattc 
tccatgacat 
cctctgaatg 
gcaaatataa 
aagaggagat 
tccaagttga 
agcatcacct 
gtggctactc 
gtaaccagaa 
caacttgcaa 
actatactcc 
tgctctacaa 



ccaaggccgc 
accggcgccg 
gaagcggcgc 
aggaaaatgc 
aactgcggac 
tgttgagcgg 
tggctacatg 
tggacctcat 
actgttttat 
gaagcctctt 
agaactatta 
ccaaggaagt 
agggccacaa 
caacatacgg 
tagcaacgat 
gcacaatgag 
tctccaagaa 
tcccagtggt 
aattattcat 
taaaacaggt 
tgaaggaagg 
tgaaggagtt 
catcctacta 
tatcccagta 
tgtgacgcca 
ctttcatgtt 
caaaacaggt 
acgatccagt 
agcaattacc 
tgaagtcaga 
gtacgtagtc 
acattcttgc 
gaatccacac 
aacaaaggaa 
tccagaaatt 
gcctcatgat 



tgctactgcc 

ccgccgagga 

aggcccgctc 

aacatggcag 

tgtgaggaga 

tattcctgga 

atggctaagg 

tcagacagaa 

tctgaaattc 

ttggatcttt 

agagaaagaa 

ggaacatttc 

ggatttacgc 

atggatccaa 

atttggatat 

ctagccaaca 

gaatttgata 

ggtaaaattc 

gttacatccc 

acagcaaatc 

atcatagatg 

gaatatattg 

gatcgctccc 

gaagatgatg 

ctaattatct 

tttccccaaa 

ttccgtcatt 

ggtgggctgc 

agtggtgaat 

aggctggtat 

agttacgtaa 

tgcatcagtc 

tgtgtgtccc 

ttttgggcca 

ttctcttttg 

ctacagcctg 



gccgctgctt 

agccactgca 

catagcgcac 

cagcaatgga 

atattgaatc 

gtcagcttaa 

caccacatga 

tctattacct 

ccaaaactat 

ttcaggcaac 

aacgcattgg 

tgtttcaagc 

aacaaccttt 

aattatgccc 

ctaacatcgt 

tggaagaaga 

gatattctgg 

ttagaattct 

ctatgttgga 

ctaaagtcac 

tcatagataa 

ccagagctgg 

agactcgcct 

ttatggaaag 

atgaagaaac 

gtcacgaaga 

tatacaaaat 

ctgctccaag 

gggaagttct 

attttgaagg 

atcctggaga 

agcactgtga 

tttacaagct 

ccattttgga 

aaagtactac 

gaaagaaata 



cttagtgccg 

accaggaccg 

gtcgggacgg 

aacagaacag 

acaggatcgg 

aaagctgctt 

tttcatgttt 

tgccatgtct 

caatagagca 

actggactat 

aacagtcgga 

cggtagtgga 

aaggcccaat 

tgctgatcca 

aaccagagaa 

tgccagatca 

ctattggtgg 

atatgaagaa 

aacaaggagg 

ttttaagatg 

ggaactaatt 

atggactcct 

acagatagtg 

gcagagactc 

aacagacatc 

ggaaattgag 

tacatctatt 

tgatttcaag 

tggccggcat 

caccaaagac 

ggtgacaagg 

cttctttata 

atcaagtcct 

ttcagcaggt 

tggatttaca 

tcctactgtg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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ctgttcatat 
atgatacagg 
acttaggatc 
tcttacatgg 
ttttagtgag 
taagagttcc 
accttggatc 
tggtatacac 
atcatcacat 
gcaggggtct 
atgatacata 
aaaaagacat 
acttcatgaa 
gatattagtg 
acttaaatgt 
ctaataacta 
agtggcctcc 
tctagtataa 
gcataaaggc 
acaggcagaa 
ttttccttta 
tgccagtttc 
ggaatcacag 
gggcctgatt 
aaactgaaat 
ccagcacagc 
taattctttt 
taaaattatt 
cctctttacc 
aagcctgtaa 
agctatgaag 
tggggtgcta 
ctttttatca 
aagaaaattt 
gcacagcctc 
agtgggaggt 
aggtattggg 
tatctgtata 
atattttcaa 
ccccttaagt 



atggtggtcg 
atacacggaa 
tgtggccatg 
tttcctggat 
ggctggaaag 
tgaatcggga 
acgtattgct 
tggctattta 
tttgatacct 
acggtttgtg 
ttcctgagag 
tagcaccatg 
cttaattagt 
attggtttgg 
aattgaatag 
ttaaacttga 
taaagacact 
ataattctca 
tagtaactgt 
aatttttaat 
agaaaaggaa 
catttggtat 
tggggtgggg 
aaaaactaag 
ttaaacaact 
tacagagtaa 
tttttttcct 
tgaacataat 
ataaactaat 
acagtttttc 
agagctcata 
gattggtcac 
cacacttcag 
gctatatttc 
cattagtaag 
agggaaagcc 
ttttgctgtt 
tgtattgctt 
tgttaaatta 
ataaatctga 



gttgctattg 
cgttatatgg 
caagcagaaa 
gagaatgtcc 
ccatatgatt 
gaacattatg 
gctctaaaag 
accaaatgag 
gccatgtaac 
gtagtaatct 
acccagcaat 
tattcatact 
gtatttttac 
ttcagttcca 
cttatgcttc 
cttaaccagt 
tgttttggca 
tttttctttg 
atatgtagca 
caaatgatta 
aaagtacaca 
ttccttatta 
agacctgggt 
cagaagtagt 
gtaatattaa 
catgattcag 
cttaggttta 
ttagatattg 
ccctggcaag 
tgaatgatca 
ggctttcagg 
attgacacca 
tgtttagaaa 
tttctcatca 
gttggtgact 
cagaaatggc 
tgtatttata 
ttcatgtttg 
aattgatttg 
ctg 



ctggggcccc 
gtcaccctga 
agttcccctc 
attttgcaca 
tacagatcta 
aactgcatct 
tgatataatt 
gaggtttaat 
atctactcct 
aataccttaa 
accataagaa 
accctatttt 
agtatacttt 
gaatctttga 
attgcttggg 
cattcattaa 
ctgaccagtt 
atgatattaa 
tggatttaat 
gagcttaaat 
ttcactagaa 
aaatattcta 
ttggggaatg 
tttaacaaaa 
ggaaaccaga 
gggaggaaaa 
gaaatcttac 
taagcttaaa 
ccatggctct 
tgaacttttc 
tgctaattga 
gtggcaggga 
gttattacca 
ctacagagag 
attggtaaga 
aggatgatat 
ctgtataata 
atattttccc 
ggtaactttc 



agtcactctg 
ccagaatgaa 
tgaaccaaat 
taccagtata 
tcctcaggag 
tttgcactac 
ttgacctgtg 
caacagaaaa 
gaaaataaat 
ccccacatgc 
ttactaaaaa 
cacttttaat 
tgagtttgtt 
ctagttacag 
catatccagc 
taatttttca 
tttagccaat 
cagagtgggc 
tagtcatgat 
atttgcaggc 
ttcttcagaa 
gaattttaag 
acagagagaa 
atactcatga 
atcaataaat 
gttccttaga 
aaatttaaac 
atacaaatgt 
cttttttttt 
ttggtttagc 
gatctgccct 
aggcatctat 
atacttttaa 
agtagatttc 
ggtggacttc 

ggtggttctg 
gataccacgc 
atgccaagat 
ttccccaaga 



tggatcttct 
cagggctatt 
cgtttactgc 
ttactgagtt 
agacacagca 
cttcaagaaa 
tagaactctc 
cacagaattg 
gtggtgccat 
tcaaaatcaa 
aaaaaaaaaa 
agtattataa 
aaaatatgat 
atttgatagc 
atgttatgaa 
aggataactt 
ttaatctgta 
ttttcctttt 
attgataatt 
aagttttttt 
aatttagtgg 
gagattgaag 
gaggtggtga 
aaatgtttgg 
cactgtcttg 
gttactttta 
tttatccttt 
ttatagataa 
ttggtgttta 
actaggattt 
gttagagtct 
gagtttgatg 
acaacactcc 
cccatagaga 
attgacacca 
tcgttgggaa 
tttttcttat 
ttgtttatat 
aagtattttc 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4523 



<210> 9 

<211> 241 

<212> PRT 

<213> Homo sapiens 

<400> 9 



Met 


Ala 


Ala 


Ala 


Met 


Glu 


Thr 


Glu 


Gin 


Leu 


Gly 


Val 


Glu 


He 


Phe 


Glu 


1 








5 










10 










15 




Thr 


Ala 


Asp 


Cys 


Glu 


Glu 


Asn 


He 


Glu 


Ser 


Gin 


Asp 


Arg 


Pro 


Lys 


Leu 








20 










25 










30 






Glu 


Pro 


Phe 


Tyr 


Val 


Glu 


Arg 


Tyr 


Ser 


Trp 


Ser 


Gin 


Leu 


Lys 


Lys 


Leu 






35 










40 










45 








Leu 


Ala 


Asp 


Thr 


Arg 


Lys 


Tyr 


His 


Gly 


Tyr 


Met 


Met 


Ala 


Lys 


Ala 


Pro 




50 










55 










60 










His 


Asp 


Phe 


Met 


Phe 


Val 


Lys 


Arg 


Asn 


Asp 


Pro 


Asp 


Gly 


Pro 


His 


Ser 


65 










70 










75 










80 


Asp 


Arg 


He 


Tyr 


Tyr 


Leu 


Ala 


Met 


Ser 


Gly 


Glu 


Asn 


Arg 


Glu 


Asn 


Thr 










85 










90 










95 




Leu 


Phe 


Tyr 


Ser 


Glu 


He 


Pro 


Lys 


Thr 


He 


Asn 


Arg 


Ala 


Ala 


Val 


Leu 
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100 

Met Leu Ser Trp 
115 

Tyr Gly Met Tyr 
130 

He Gly Thr Val 
145 

Thr Phe Leu Phe 

Gly Pro Gin Gly 
180 

Thr Ser Cys Pro 
195 

Pro Asp Trp He 
210 

He Val Thr Arg 

225 

Ala 



Lys Pro Leu Leu 
120 

Ser Arg Glu Glu 
135 

Gly He Ala Ser 
150 

Gin Ala Gly Ser 
165 

Phe Thr Gin Gin 

Asn He Arg Met 
200 

Ala Phe He His 
215 

Glu Glu Arg Arg 
230 



105 



Asp 


Leu 


Phe 


Gin 


Glu 


Leu 


Leu 


Arg 








140 


Tvr 


ASD 


Tvr 


His 






155 




Gly 


He 


Tyr 


His 




170 






Pro 


Leu 


Arg 


Pro 


185 








Asp 


Pro 


Lys 


Leu 


Ser 


Asn 


Asp 


He 








220 


Leu 


Thr 


Tyr 


Val 






235 







110 




Ala 


Thr 


Leu Asp 


125 






Glu 


Arg 


Lys Arg 


Gin 


Gly 


Ser Gly 






160 


Val 


Lys 


Asp Gly 






175 


Asn 


Leu 


Val Glu 




190 




Cys 


Pro 


Ala Asp 


205 






Trp 


He 


Ser Asn 


His 


Asn 


Gly Lys 






240 



<210> 10 

<211> 1356 

<212> DNA 

<213> Homo sapiens 

<400> 10 



<210> 11 

<211> 661 

<212> PRT 

< 2 1 3 > Homo s ap i ens 

<400> 11 



Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu He Phe Glu 

15 10 15 

Thr Ala Asp Cys Glu Glu Asn He Glu Ser Gin Asp Arg Pro Lys Leu 

20 25 30 

Glu Pro Phe Tyr Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 



aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 12 0 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 24 0 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 42 0 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 54 0 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 84 0 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatggt aaggcgtagt tcttcagatt tacttttctg 960 

aacagtattt tttgaagtat aatttgctgc ttgcattttg aaattagatt accacgttgg 102 0 

gtgatcttta tatttgaaat tcaagtcttt aaaattttta aaaaatggag aaaagtacag 1080 

aggataactt gtatgtacca catgtataat attcatttta atgttttaat gttcattttc 114 0 

aaacagtgaa acaaaagaac ctctgacatg attgttcttt tagcttgeta agactgccag 1200 

aattttccca aaactgttct tattaaaata aaattttagg ctaggcatgg tggctcatgc 1260 

ctgtaatcct agcactctgg gaggctgagg caggcagatt gtttgagccc agaagttcaa 132 0 

gatcaggatg ggcaacatgg tgacacctcg tttgac '"^'^^ 



1356 
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Leu 


Ala 


Asp 


Thr 




50 






His 


Asp 


Phe 


Met 


65 








Asp 


Arg 


He 


Tyr 


Leu 


Phe 


Tyr 


Ser 








100 


Met 


Leu 


Ser 


Trp 






115 




Tyr Gly Met 


Xyr 




X J u 








vaxy 


Thir* 


Val 


x4b 










Jrlie 


T .oi 1 
XJC u 


Phe 


Gly 


Pro 


Gin 


Gly 








180 


Thr 


Ser 


Cys 


Pro 






195 




Pro 


Asp 


Trp 


He 




210 






lie 


Val 


Thr 


Arg 


225 








Ala 


Asn 


Met 


Glu 


Leu 


Gin 


Glu 


Glu 








260 


Ala 


Glu 


Thr 


Thr 






275 




Glu 


Asn 


Asp 


Glu 




290 






Leu 


Glu 


Thr 


Arg 


305 








Ala 


Asn 


Pro 


Lys 


Glu Gly Arg 


He 








340 


Glu 


He 


Leu 


Phe 






355 




Pro 


Glu 


Gly 


Lys 




370 






Arg 


Leu 


Gin 


He 


385 








Asp 


Asp 


Val 


Met 


Val 


Thr 


Pro 


Leu 








420 


He 


His 


Asp 


He 






435 




Glu 


Phe 


He 


Phe 




450 






Lys 


He 


Thr 


Ser 


465 








Gly 


Leu 


Pro 


Ala 


Ala 


He 


Thr 


Ser 








500 


He 


Gin 


Val 


Asp 



40 

Arg Lys Tyr His 
55 

Phe Val Lys Arg 
70 

Tyr Leu Ala Met 
85 

Glu He Pro Lys 

Lys Pro Leu Leu 
120 

Ser Arg Glu Glu 
135 

Gly He Ala Ser 
150 

Gin Ala Gly Ser 
165 

Phe Thr Gin Gin 

Asn He Arg Met 
200 

Ala Phe He His 
215 

Glu Glu Arg Arg 
230 

Glu Asp Ala Arg 
245 

Phe Asp Arg Tyr 

Pro Ser Gly Gly 
280 

Ser Glu Val Glu 
295 

Arg Ala Asp Ser 
310 

val Thr Phe Lys 
325 

He Asp Val He 

Glu Gly Val Glu 
360 

Tyr Ala Trp Ser 
375 

Val Leu He Ser 
390 

Glu Arg Gin Arg 
405 

He He Tyr Glu 

Phe His Val Phe 
440 

Ala Ser Glu Cys 
455 

He Leu Lys Glu 
470 

Pro Ser Asp Phe 
485 

Gly Glu Trp Glu 
Glu Val Arg Arg 



Gly Tyr Met Met 
60 

Asn Asp Pro Asp 
75 

Ser Gly Glu Asn 
90 

Thr He Asn Arg 
105 

Asp Leu Phe Gin 

Glu Leu Leu Arg 
140 

Tyr Asp Tyr His 
155 

Gly He Tyr His 
170 

Pro Leu Arg Pro 
185 

Asp Pro Lys Leu 

Ser Asn Asp He 
220 

Leu Thr Tyr Val 
235 

Ser Ala Gly Val 
250 

Ser Gly Tyr Trp 
265 

Lys He Leu Arg 

He He His Val 
300 

Phe Arg Tyr Pro 
315 

Met Ser Glu He 
330 

Asp Lys Glu Leu 
345 

Tyr He Ala Arg 

He Leu Leu Asp 
380 

Pro Glu Leu Phe 
395 

Leu He Glu Ser 
410 

Glu Thr Thr Asp 
425 

Pro Gin Ser His 

Lys Thr Gly Phe 
460 

Ser Lys Tyr Lys 
475 

Lys Cys Pro He 
490 

Val Leu Gly Arg 
505 

Leu Val Tyr Phe 



45 

Ala Lys Ala Pro 

Gly Pro His Ser 
80 

Arg Glu Asn Thr 
95 

Ala Ala Val Leu 
110 

Ala Thr Leu Asp 
125 

Glu Arg Lys Arg 

Gin Gly Ser Gly 
160 

Val Lys Asp Gly 
175 

Asn Leu Val Glu 
190 

Cys Pro Ala Asp 
205 

Trp He Ser Asn 

His Asn Glu Leu 
240 

Ala Thr Phe Val 
255 

Trp Cys Pro Lys 
270 

He Leu Tyr Glu 
285 

Thr Ser Pro Met 

Lys Thr Gly Thr 
320 

Met He Asp Ala 
335 

He Gin Pro Phe 
350 

Ala Gly Trp Thr 
365 

Arg Ser Gin Thr 

He Pro Val Glu 
400 

Val Pro Asp Ser 
415 

He Trp He Asn 
430 

Glu Glu Glu He 
445 

Arg His Leu Tyr 

Arg Ser Ser Gly 
480 

Lys Glu Glu He 
495 

His Gly Ser Asn 
510 

Glu Gly Thr Lys 



1(» 
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WO 02/31 134 

520 ^25 



ASP Ser III Leu Glu His His L^u Tyr Val Val Ser Tyr Val Asn Pro 

Gly llu val Thr Arg Leu Thr Asp Arg Gly Tyr Ser His Ser Cys Cys 

ne ser Gin His Cys Isp Phe Phe He Ser Lys Tyr Ser Asn Gin Lys 

565 570 575 

Asn pro His cys Val Ser Leu Tyr Lys Leu Ser Ser Pro Glu Asp Asp 

580 585 
Pro Thr cys Lys Thr Lys Glu Phe Trp Ala Thr He Leu Asp Ser Ala 



Gly pro Leu Pro Asp Tyr Thr ^^o Pro Glu He Phe Ser Phe Glu Ser 



600 



610 620 
Thr Thr Gly Phe Thr Leu Tyr Gly Met Leu Tyr Lys Pro His Asp Leu 

g" Pro Gly Lys Lys ^ Pro Thr Val Leu Phe He Tyr Gly Gly Leu 
645 650 ^'=>^ 

Leu Arg Cys Ser Trp 
660 



<210> 12 

<211> 4829 

<212> DNA 

<213> Homo sapiens 

<400> 12 



=,=r,t-ar»-aaa acctccqagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 
ca?t?pccgc ctgggt^Sc accgl^gccg ccgccgagga agccactgca accaggaccg 
aStqqaggc ggclcagcat gaa|cg|cgc aggcccgctc catagcgcac gtcgggacgg 
tSaaqcqqq Icclggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 
ctllltlltl %lllltl% alctgcgglc tgtgaggaga atattgaatc acaggatcgg 
cclaaallll a|ccttttL tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 
gccgatacc! glaaatatca tggctacatg atggctaagg ^accacatga "^catgttt 
*^ \.,^rrr^:^ r cr A t ccaoa tQQacctcat tcagacagaa tctattacct tgccacgccc 
SgaSaca ga^JaaaSc a??gttttat tctgaaattc ccaaaactat caatagagca 
acaatcJtaa ?gctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 
aaaltqtatt cfcgagaagi Igalctatta agagaaagaa aacgcattgg aacagtcgga 
ITtlTtctt aSt?atca claaggaagt ggaacatttc tgtttcaagc cggtagtgga 
att?a^cacq taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 
Saotaqaa! ctagSgtcc caaLtacgg atggatccaa aattatgccc tgctgatcca 
gacSSttg ctStafaca tagcaacgat atttggatat ctaacatcgt aaccagagaa 
tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 
qcSSScg ctacctJtIt Ltccaalal gaatttgata gatattctgg ctattggtgg 
TlfcllTi ctgaaacaL tcccagtggt ggtaaaattc ttagaattct atatgaagaa 
Itllllallt ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 
qcaStSat tccS??a?cc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 
?caSaataa tgaftgatgc tgaaggaagg atcatagatg tcatagataa SSaactaatt 
caaccttttg a|attctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 
oaaaaaaaa? atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 
llllTcTcTc ctlaatStt tatcccagta gaagatgatg ttatggaaag gcagagactc 
lfZ.r,lrTa talctqattc tgtgacgcca ctaattatct atgaagaaac aacagaoatc 
attgagtcag ^gcctgattc I tttccccaaa gtcacgaaga ggaaattgag 

lltTtTtttl oTclllTs caaaa^algt ttccgtcatt tatacaaaat tacatctatt 
^taJaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 
tgJccStca lagaggagat agcaattacc agtggtgaat ggsaagttct tggccggcat 
aaatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 
tcccctJtag agcatcacct gtacgtagtc agttacgtaa atcctggaga g^tgacaagg 
^Jaactqacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 
aq?aaq?ata Itllccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 
gaagSacc caacttgcaa Lcaaaggaa ttttgggcca ccattttgga ttcagcaggt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



17 
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cctcttcctg 
ttgtatggga 
ctgttcatat 
agtatttccg 
ggggatcctg 
tagaaattga 
acttagatcg 
taatgcagag 
tcttctatga 
gctattactt 
tactgctctt 
tgagtttttt 
acagcataag 
aagaaaacct 
actctctggt 
gaattgatca 
tgccatgcag 
aatcaaatga 
aaaaaaaaaa 
ttataaactt 
tatgatgata 
gatagcactt 
tatgaactaa 
taacttagtg 
tctgtatcta 
ccttttgcat 
ataattacag 
tttttttttt 
tagtggtgcc 
ttgaagggaa 
tggtgagggc 
gtttggaaac 
gtcttgccag 
cttttataat 
tccttttaaa 
agataacctc 
tgtttaaagc 
ggatttagct 
gagtcttggg 
ttgatgcttt 
cactccaaga 
tagagagcac 
acaccaagtg 
tgggaaaggt 
tcttattatc 
ttatatatat 
attttccccc 



PCT/USOl/31874 



actatactcc 
tgctctacaa 
atggtggtct 
cttgaatacc 
tcaccgaggg 
cgatcaggtg 
tgtgggcatc 
gtcagatatc 
tacaggatac 
aggatctgtg 
acatggtttc 
agtgagggct 
agttcctgaa 
tggatcacgt 
atacactggc 
tcacattttg 
gggtctacgg 
tacatattcc 
agacattagc 
catgaactta 
ttagtgattg 
aaatgtaatt 
taactattaa 
gcctcctaaa 
gtataaataa 
aaaggctagt 
gcagaaaatt 
cctttaagaa 
agtttccatt 
tcacagtggg 
ctgattaaaa 
tgaaatttaa 
cacagctaca 
tctttttttt 
attatttgaa 
tttaccataa 
ctgtaaacag 
atgaagagag 
gtgctagatt 
ttatcacaca 
aaatttgcta 
agcctccatt 
ggaggtaggg 
attgggtttt 
tgtatatgta 
tttcaatgtt 
ttaagtataa 



tccagaaatt 
gcctcatgat 
cctcaggtgc 
ctagcctctc 
cttaaatttg 
gaaggactcc 
cacggctggt 
ttcagggttg 
acggaacgtt 
gccatgcaag 
ctggatgaga 
ggaaagccat 
tcgggagaac 
attgctgctc 
tatttaacca 
atacctgcca 
tttgtggtag 
tgagagaccc 
accatgtatt 
attagtgtat 
gtttggttca 
gaatagctta 
acttgactta 
gacacttgtt 
ttctcatttt 
aactgtatat 
tttaatcaaa 
aaggaaaaag 
tggtatttcc 
gtggggagac 
actaagcaga 
acaactgtaa 
gagtaacatg 
tttcctctta 
cataatttag 
actaatccct 
tttttctgaa 
ctcataggct 
ggtcacattg 
cttcagtgtt 
tatttctttc 
agtaaggttg 
aaagcccaga 
gctgtttgta 
ttgcttttca 
aaattaaatt 
atctgactg 



ttctcttttg 
ctacagcctg 
agttggtgaa 
taggttatgt 
aaggcgcctt 
aatatctagc 
cctatggagg 
ctattgctgg 
atatgggtca 
cagaaaagtt 
atgtccattt 
atgatttaca 
attatgaact 
taaaagtgat 
aatgaggagg 
tgtaacatct 
taatctaata 
agcaatacca 
catactaccc 
ttttacagta 
gttccagaat 
tgcttcattg 
accagtcatt 
ttggcactga 
tctttgatga 
gtagcatgga 
tgattagagc 
tacacattca 
ttattaaaat 
ctgggtttgg 
agtagtttta 
tattaaggaa 
attcagggga 
ggtttagaaa 
atattgtaag 
ggcaagccat 
tgatcatgaa 
ttcaggtgct 
acaccagtgg 
tagaaagtta 
tcatcactac 
gtgactattg 
aatggcagga 
tttatactgt 
tgtttgatat 
gatttgggta 



aaagtactac 
gaaagaaata 
taatcggttt 
ggttgtagtg 
taaatataaa 
ttctcgatat 
atacctctcc 
ggccccagtc 
ccctgaccag 
cccctctgaa 
tgcacatacc 
gatctatcct 
gcatcttttg 
ataattttga 
tttaatcaac 
actcctgaaa 
ccttaacccc 
taagaattac 
tattttcact 
tacttttgag 
ctttgactag 
cttgggcata 
cattaataat 
ccagttttta 
tattaacaga 
tttaattagt 
ttaaatattt 
ctagaattct 
attctagaat 
ggaatgacag 
acaaaaatac 
accagaatca 
ggaaaagttc 
tcttacaaat 
cttaaaatac 
ggctctcttt 
cttttcttgg 
aattgagatc 
cagggaaggc 
ttaccaatac 
agagagagta 
gtaagaggtg 
tgatatggtg 
ataatagata 
tttcccatgc 
actttcttcc 



tggatttaca 
tcctactgtg 
aaaggagtca 
atagacaaca 
atgggtcaaa 
gatttcattg 
ctgatggcat 
actctgtgga 
aatgaacagg 
ccaaatcgtt 
agtatattac 
caggagagac 
cactaccttc 
cctgtgtaga 
agaaaacaca 
ataaatgtgg 
acatgctcaa 
taaaaaaaaa 
tttaatagta 
tttgttaaaa 
ttacagattt 
tccagcatgt 
ttttcaagga 
gccaatttaa 
gtgggctttt 
catgatattg 
gcaggcaagt 
tcagaaaatt 
tttaaggaga 
agagaagagg 
tcatgaaaat 
ataaatcact 
cttagagtta 
ttaaacttta 
aaatgtttat 
ttttttttgg 
tttagcacta 
tgccctgtta 
atctatgagt 
ttttaaacaa 
gatttcccca 
gacttcattg 
gttctgtcgt 
ccacgctttt 
caagatttgt 
ccaagaaagt 



<210> 13 

<211> 358 

<212> PRT 

<213> Homo sapiens 

<:400> 13 

Met Ala Ala Ala Met Glu Thr Glu Gin Leu Gly Val Glu lie Phe Glu 

Ma ASP cys Glu Glu Asn He Glu Ser Gin Asp Arg Pro Lys Leu 

Olu Pro Phe T^r Val Glu Arg Tyr Ser Trp Ser Gin Leu Lys Lys Leu 
35 



2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4829 



IS 



PCT/USOl/31874 

WO 02/31134 

.eu .la ASP Arg Lys His Gly Xyr «et Met Ma .ys .la Pro 

His Phe Met Phe val .ys Arg Asn Asp Pro Asp Oly Pro His Ser 

fsp Arg lie ^r Tyr ^eu Ala Met Ser Oly Glu Asn Arg Olu Asn T.r 

.eu P.e Tyr Ser ^lu He Pro .ys .hr .le Asn Arg Ala Ala val .eu 

„et .eu ser Z .ys - .eu Asp .eu P.e Oln Ala T.r .eu Asp 

. Ill Tvr ser Arg Glu Glu Glu Leu Leu Arg Glu Arg Lys Arg 
Tyr Gly Met Tyr ser Arg 

130 , l^l Tvr ASP Tyr His Gin Gly Ser Gly 

lie Gly Thr val Gly He Ala ser Tyr asp ry^ 

it^r P.e Leu Phe Gin III Gly Ser Gly lie ^r His Val Lys Asp Gly 

P.O Gin Gly Pne T.r Gin Gin Pro Leu Arg Pro Asn Leu Val Glu 

^.r. Tie Arq Met Asp pro Lys Leu Cys Pro Ala Asp 
Thr ser Cys Pro Asn He Arg nec asf 

l^^ Tie Ala Phe lie His Ser Asn Asp He Trp He Ser Asn 
pro Asp Trp He Ala pne ixe n 

210 , „ Thr Tvr val His Asn Glu Leu 

lie val Thr Arg Glu Glu Arg Arg Leu Thr Tyr vai h 

III Asn Met Glu Glu isp Ala Arg Ser Ala Gly Val Ala Thr Phe val 

.eu om Glu Olu III ASP Arg Tyr Ser Gly Tyr Trp Trp Cys Pro Lys 

Olu Thr ^^r Pro Ser Gly Gly Lys He Leu Arg He Leu Tyr Glu 

. . Ill Glu ser Glu val Glu He He His Val Thr ser Pro Met 

Glu Asn Asp Glu ber ^j-u 

^„ III Th. «9 ™ "° 

III .sn V.1 Het S.r .lu lie «.t XXe .sp 

T M»t- TV.? Ser Glu Gly Trp Tyr He Leu Lys 
Glu Gly Arg Ser Lys Leu Met Lys Ser Giu y f 

340 "^^^ 



Ala pro Lys Thr Pro Leu 
355 

<210> 14 

<211> 4309 

<212> DNA 

<213> Homo sapiens 

<400> 14 



^ .c.aa ccaaqqccgc tgctactgcc gccgctgctt cttagtgccg 60 
aagtgctaaa gcctccgagg c-^^^^gccgc ^ agccactgca accaggaccg 120 

cgttcgccgc ctgggttgtc accggcgccg ^cgccgagg | gtcgggacgg X80 

gagtggaggc ggcgcagcat gaagcggcgc llf llf ll ^^gcaatgga aacagaacag 240 
?ccgg|cigg gccgggggga aggaaaatgc aacatggcag ^ag^ gg^ ,,aggatcgg 300 
ctgggtgttg agatatttga J-tgcggac tgtgagg g^ g,,,gettaa aaagctgctt 
cctaaattgg agccttttta tgttgagcgg caccacatga tttcatgttt 

gccgatacca gaaaatatca tggctacatg J^ggct gg ^^^^^^ tgccatgtct 

Itglagagga atgatccaga ^ggacctcat ^cagacag ^^^^^^^^^^ caatagagca 
ggtgagaaca gag-aatac -tgttttat tctg ,,,,gg,,ac actggactat 

gcagtcttaa tgctctcttg S^JI^^^^J^ Lagaaagaa aacgcattgg aacagtcgga 
ggaatgtatt ctcgagaaga agaactatta agag^^^g^^ tgtttcaagc cggtagtgga 

fttfatcfcg rafaagftgg fgggcfacL gatttacgc aacaaccttt aaggcccaat 



360 
420 
480 
540 

600 
660 
720 
780 



1*» 



wo 02/31134 



PCT/USOl/31874 



ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaaga tccaagttga tgaagtcaga aggctggtat 1260 

attttgaagg caccaaagac tcccctttag agcatcacct gtacgtagtc agttacgtaa 1320 

atcctggaga ggtgacaagg ctgactgacc gtggctactc acattcttgc tgcatcagtc 1380 

agcactgtga cttctttata agtaagtata gtaaccagaa gaatccacac tgtgtgtccc 1440 

tttacaagct atcaagtcct gaagatgacc caacttgcaa aacaaaggaa ttttgggcca 1500 

ccattttgga ttcagcaggt cctcttcctg actatactcc tccagaaatt ttctcttttg 1560 

aaagtactac tggatttaca ttgtatggga tgctctacaa gcctcatgat ctacagcctg 1620 

gaaagaaata tcctactgtg ctgttcatat atggtggtct cctcaggtgc agttggtgaa 1680 

taatcggttt aaaggagtca agtatttccg cttgaatacc ctagcctctc taggttatgt 1740 

ggttgtagtg atagacaaca ggggatcctg tcaccgaggg cttaaatttg aaggcgcctt 1800 

taaatataaa atgggtcaaa tagaaattga cgatcaggtg gaaggactcc aatatctagc 1860 

ttctcgatat gatttcattg acttagatcg tgtgggcatc cacggctggt cctatggagg 192 0 

atacctctcc ctgatggcat taatgcagag gtcagatatc ttcagggttg ctattgctgg 1980 

ggccccagtc actctgtgga tcttctatga tacaggatac acggaacgtt atatgggtca 204 0 

ccctgaccag aatgaacagg gctattactt aggatctgtg gccatgcaag cagaaaagtt 2100 

cccctctgaa ccaaatcgtt tactgctctt acatggtttc ctggatgaga atgtccattt 2160 

tgcacatacc agtatattac tgagtttttt agtgagggct ggaaagccat atgatttaca 2220 

gatctatcct caggagagac acagcataag agttcctgaa tcgggagaac attatgaact 2280 

gcatcttttg cactaccttc aagaaaacct tggatcacgt attgctgctc taaaagtgat 234 0 

ataattttga cctgtgtaga actctctggt atacactggc tatttaacca aatgaggagg 2400 

tttaatcaac agaaaacaca gaattgatca tcacattttg atacctgcca tgtaacatct 2460 

actcctgaaa ataaatgtgg tgccatgcag gggtctacgg tttgtggtag taatctaata 252 0 

ccttaacccc acatgctcaa aatcaaatga tacatattcc tgagagaccc agcaatacca 258 0 

taagaattac taaaaaaaaa aaaaaaaaaa agacattagc accatgtatt catactaccc 264 0 

tattttcact tttaatagta ttataaactt catgaactta attagtgtat ttttacagta 2700 

tacttttgag tttgttaaaa tatgatgata ttagtgattg gtttggttca gttccagaat 2760 

ctttgactag ttacagattt gatagcactt aaatgtaatt gaatagctta tgcttcattg 2820 

cttgggcata tccagcatgt tatgaactaa taactattaa acttgactta accagtcatt 2 88 0 

cattaataat ttttcaagga taacttagtg gcctcctaaa gacacttgtt ttggcactga 2940 

ccagttttta gccaatttaa tctgtatcta gtataaataa ttctcatttt tctttgatga 3000 

tattaacaga gtgggctttt ccttttgcat aaaggctagt aactgtatat gtagcatgga 3060 

tttaattagt catgatattg ataattacag gcagaaaatt tttaatcaaa tgattagagc 3120 

ttaaatattt gcaggcaagt tttttttttt cctttaagaa aaggaaaaag tacacattca 3180 

ctagaattct tcagaaaatt tagtggtgcc agtttccatt tggtatttcc ttattaaaat 3240 

attctagaat tttaaggaga ttgaagggaa tcacagtggg gtggggagac ctgggtttgg 3300 

ggaatgacag agagaagagg tggtgagggc ctgattaaaa actaagcaga agtagtttta 3360 

acaaaaatac tcatgaaaat gtttggaaac tgaaatttaa acaactgtaa tattaaggaa 342 0 

accagaatca ataaatcact gtcttgccag cacagctaca gagtaacatg attcagggga 3480 

ggaaaagttc cttagagtta cttttataat tctttttttt tttcctctta ggtttagaaa 3540 

tcttacaaat ttaaacttta tccttttaaa attatttgaa cataatttag atattgtaag 3600 

cttaaaatac aaatgtttat agataacctc tttaccataa actaatccct ggcaagccat 3660 

ggctctcttt ttttttttgg tgtttaaagc ctgtaaacag tttttctgaa tgatcatgaa 3720 

cttttcttgg tttagcacta ggatttagct atgaagagag ctcataggct ttcaggtgct 3780 

aattgagatc tgccctgtta gagtcttggg gtgctagatt ggtcacattg acaccagtgg 3 84 0 

cagggaaggc atctatgagt ttgatgcttt ttatcacaca cttcagtgtt tagaaagtta 3 900 

ttaccaatac ttttaaacaa cactccaaga aaatttgcta tatttctttc tcatcactac 3960 

agagagagta gatttcccca tagagagcac agcctccatt agtaaggttg gtgactattg 4020 

gtaagaggtg gacttcattg acaccaagtg ggaggtaggg aaagcccaga aatggcagga 4 08 0 

tgatatggtg gttctgtcgt tgggaaaggt attgggtttt gctgtttgta tttatactgt 4140 

ataatagata ccacgctttt tcttattatc tgtatatgta ttgcttttca tgtttgatat 4200 

tttcccatgc caagatttgt ttatatatat tttcaatgtt aaattaaatt gatttgggta 4260 

actttcttcc ccaagaaagt attttccccc ttaagtataa atctgactg 4309 



20 
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<210> 15 

<211> 108 

<212> PRT 

<213 > Homo sapiens 

<400> 15 

Met Ala Ala Ala Met Glu Thr Glu 
1 5 
Thr Ala Asp Cys Glu Glu Asn lie 
20 

Glu Pro Phe Tyr Val Glu Arg Tyr 

35 40 
Leu Ala Asp Thr Arg Lys Tyr His 

50 55 
His Asp Phe Met Phe Val Lys Arg 
65 70 
Asp Arg He Tyr Tyr Leu Gly Asn 
85 

Phe Ser Lys Ser Lys Met Pro Glu 
100 



Gin 


Leu 


Gly 


Val 


Glu 


He 


Phe 


Glu 




10 










15 




Glu 


Ser 


Gin 


Asp 


Arg 


Pro 


Lys 


Leu 


25 










30 






Ser 


Trp 


Ser 


Gin 


Leu 


Lys 


Lys 


Leu 










45 








Gly 


Tyr 


Met 


Met 


Ala 


Lys 


Ala 


Pro 








60 










Asn 


Asp 


Pro 


Asp 


Gly 


Pro 


His 


Ser 






75 










80 


Lys 


Ser 


Leu 


He 


Asp 


His 


Asp 


Arg 


90 










95 




He 


Ala 


Ser 


Ser 










105 

















<210> 16 

<211> 620 

<212> DNA 

<213> Homo sapiens 

<400> 16 



aagtgctaaa 
cgttcgccgc 
gagtggaggc 
tccgggcggg 
ctgggtgttg 
cctaaattgg 
gccgatacca 
gtgaagagga 
tcattaattg 
agctagcttg 
tagtttgttc 



gcctccgagg 
ctgggttgtc 
ggcgcagcat 
gccgggggga 
agatatttga 
agccttttta 
gaaaatatca 
atgatccaga 
atcatgatcg 
aaatgccttt 
ccactcaaca 



ccaaggccgc 
accggcgccg 
gaagcggcgc 
aggaaaatgc 
aactgcggac 
tgttgagcgg 
tggctacatg 
tggacctcat 
tttttcaaaa 
ctttagatgg 



tgctactgcc 
ccgccgagga 
aggcccgctc 
aacatggcag 
tgtgaggaga 
tattcctgga 
atggctaagg 
tcagacagaa 
tcgaagatgc 
tctgattagg 



gccgctgctt 
agccactgca 
catagcgcac 
cagcaatgga 
atattgaatc 
gtcagcttaa 
caccacatga 
tctattacct 
cagaaattgc 
aaaacaaaca 



cttagtgccg 
accaggaccg 
gtcgggacgg 
aacagaacag 
acaggatcgg 
aaagctgctt 
tttcatgttt 
tggtaacaag 
ttcttcctaa 
ataaaaccat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
620 



<210> 17 

<211> 194 

<212> PRT 

<213> Homo sapiens 

<400> 17 
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Met 


Ala 


Ala 


Ala 


Met 


Glu 


Thr 


Glu 


Gin 


Leu 


Gly 


Val 


Glu 


He 


Phe 


Glu 


1 








5 










10 










15 




Thr 


Ala 


Asp 


Cys 
20 


Glu 


Glu 


Asn 


He 


Glu 
25 


Ser 


Gin 


Asp 


Arg 


Pro 
30 


Lys 


Leu 


Glu 


Pro 


Phe 
35 


Tyr 


Val 


Glu 


Arg 


Tyr 
40 


Ser 


Trp 


Ser 


Gin 


Leu 
45 


Lys 


Lys 


Leu 


Leu 


Ala 
50 


Asp 


Thr 


Arg 


Lys 


Tyr 
55 


His 


Gly 


Tyr 


Met 


Met 
60 


Ala 


Lys 


Ala 


Pro 


His 


Asp 


Phe 


Met 


Phe 


Val 


Lys 


Arg 


Asn 


Asp 


Pro 


Asp 


Gly 


Pro 


His 


Ser 


65 










70 










75 










80 


Asp 


Arg 


lie 


Tyr 


Tyr 
85 


Leu 


Ala 


Met 


Ser 


Gly 
90 


Glu 


Asn 


Arg 


Glu 


Asn 
95 


Thr 


Leu 


Phe 


Tyr 


Ser 
100 


Glu 


He 


Pro 


Lys 


Thr 
105 


He 


Asn 


Arg 


Ala 


Ala 
110 


Val 


Leu 


Met 


Leu 


Ser 
115 


Trp 


Lys 


Pro 


Leu 


Leu 
12 0 


Asp 


Leu 


Phe 


Gin 


Ala 
125 


Thr 


Leu 


Asp 


Tyr 


Gly 
130 


Met 


Tyr 


Ser 


Arg 


Glu 
135 


Glu 


Glu 


Leu 


Leu 


Arg 
140 


Glu 


Arg 


Lys 


Arg 


lie 


Gly 


Thr 


Val 


Gly 


He 


Ala 


Ser 


Tyr 


Asp 


Tyr 


His 


Gin 


Gly 


Ser 


Gly 


145 










150 










155 










160 


Thr 


Phe 


Leu 


Phe 


Gin 
165 


Ala 


Gly 


Ser 


Gly 


He 
170 


Tyr 


His 


Val 


Lys 


Asp 
175 


Gly 


Gly 


Pro 


Gin 


Gly 
180 


Phe 


Thr 


Gin 


Pro 


Leu 
185 


Arg 


Pro 


Asn 


Leu 


Val 
190 


Glu 


Thr 



Cys Ala 



<210> 18 

<211> 832 

<212> DNA 

<213> Homo sapiens 

<400> 18 



aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 12 0 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 24 0 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 3 00 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 54 0 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 72 0 

atttatcacg taaaagatgg agggccacaa ggatttacgc wacaaccttt aaggcccaat 78 0 

ctagtggaaa ctasttgtsc caracytgca tgacccaatc agatcctgta ga 832 

<210> 19 

<211> 658 

<212> PRT 

<213> Homo sapiens 

<400> 19 



Met Ala Ala Ala 
1 

Thr Ala Asp Cys 
20 

Glu Pro Phe Tyr 
35 

Leu Ala Asp Thr 



Met Glu Thr Glu 
5 

Glu Glu Asn He 

Val Glu Arg Tyr 
40 

Arg Lys Tyr His 



Gin Leu Gly Val 
10 

Glu Ser Gin Asp 
25 

Ser Trp Ser Gin 
Gly Tyr Met Met 



Glu He Phe Glu 
15 

Arg Pro Lys Leu 
30 

Leu Lys Lys Leu 
45 

Ala Lys Ala Pro 
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50 

His Asp Phe Met 
65 

Asp Arg lie Tyr 

Leu Phe Tyr Ser 
100 

Met Leu Ser Trp 
115 

Tyr Gly Met Tyr 
130 

lie Gly Thr Val 
14 5 

Thr Phe Leu Phe 

Gly Pro Gin Gly 
180 

Thr Ser Cys Pro 
195 

Pro Asp Trp He 
210 

He Val Thr Arg 
225 

Ala Asn Met Glu 

Leu Gin Glu Glu 
260 

Ala Glu Thr Thr 

275 

Glu Asn Asp Glu 
290 

Leu Glu Thr Arg 
305 

Ala Asn Pro Lys 

Glu Gly Arg He 
340 

Glu He Leu Phe 
355 

Pro Glu Gly Lys 
370 

Arg Leu Gin He 
385 

Asp Asp Val Met 

Val Thr Pro Leu 
420 

He His Asp He 
435 

Glu Phe He Phe 
450 

Lys He Thr Ser 
465 

Gly Leu Pro Ala 

Ala He Thr Ser 

500 

He Gin Val Asp 
515 

Asp Ser Pro Leu 



55 

Phe Val Lys Arg 
70 

Tyr Leu Ala Met 
85 

Glu He Pro Lys 

Lys Pro Leu Leu 
120 

Ser Arg Glu Glu 
135 

Gly He Ala Ser 
150 

Gin Ala Gly Ser 
165 

Phe Thr Gin Gin 

Asn He Arg Met 
200 

Ala Phe He His 
215 

Glu Glu Arg Arg 
230 

Glu Asp Ala Arg 
245 

Phe Asp Arg Tyr 

Pro Ser Gly Gly 
280 

Ser Glu Val Glu 
295 

Arg Ala Asp Ser 
310 

Val Thr Phe Lys 
325 

He Asp Val He 

Glu Gly Val Glu 
360 

Tyr Ala Trp Ser 
375 

Val Leu He Ser 
390 

Glu Arg Gin Arg 
405 

He He Tyr Glu 

Phe His Val Phe 
440 

Ala Ser Glu Cys 
455 

He Leu Lys Glu 
470 

Pro Ser Asp Phe 
485 

Gly Glu Trp Glu 

Glu Val Arg Arg 
520 

Glu His His Leu 



60 

Asn Asp Pro Asp 
75 

Ser Gly Glu Asn 
90 

Thr He Asn Arg 
105 

Asp Leu Phe Gin 

Glu Leu Leu Arg 
140 

Tyr Asp Tyr His 
155 

Gly He Tyr His 
170 

Pro Leu Arg Pro 
185 

Asp Pro Lys Leu 

Ser Asn Asp He 
220 

Leu Thr Tyr Val 
235 

Ser Ala Gly Val 
250 

Ser Gly Tyr Trp 
265 

Lys He Leu Arg 

He He His Val 
300 

Phe Arg Tyr Pro 
315 

Met Ser Glu He 
330 

Asp Lys Glu Leu 
345 

Tyr He Ala Arg 

He Leu Leu Asp 
380 

Pro Glu Leu Phe 
395 

Leu He Glu Ser 
410 

Glu Thr Thr Asp 
425 

Pro Gin Ser His 

Lys Thr Gly Phe 
460 

Ser Lys Tyr Lys 

475 

Lys Cys Pro He 
490 

Val Leu Gly Arg 
505 

Leu Val Tyr Phe 
Tyr Val Val Ser 



Gly Pro His Ser 
80 

Arg Glu Asn Thr 
95 

Ala Ala Val Leu 
110 

Ala Thr Leu Asp 
125 

Glu Arg Lys Arg 

Gin Gly Ser Gly 
160 

Val Lys Asp Gly 
175 

Asn Leu Val Glu 
190 

Cys Pro Ala Asp 
205 

Trp He Ser Asn 

His Asn Glu Leu 
240 

Ala Thr Phe Val 
255 

Trp Cys Pro Lys 

270 

He Leu Tyr Glu 
285 

Thr Ser Pro Met 

Lys Thr Gly Thr 
320 

Met He Asp Ala 
335 

He Gin Pro Phe 
350 

Ala Gly Trp Thr 
365 

Arg Ser Gin Thr 

He Pro Val Glu 
400 

Val Pro Asp Ser 
415 

He Trp He Asn 
430 

Glu Glu Glu He 
445 

Arg His Leu Tyr 

Arg Ser Ser Gly 
480 

Lys Glu Glu He 
495 

His Gly Ser Asn 
510 

Glu Gly Thr Lys 
525 

Tyr Val Asn Pro 
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Gly Glu Val Thr Arg Leu Thr Asp Arg Gly Tyr Ser His Ser Cys Cys 
545 550 555 560 

lie Ser Gin His Cys Asp Phe Phe lie Ser Lys Tyr Ser Asn Gin Lys 

565 570 575 

Asn Pro His Cys Val Ser Leu Tyr Lys Leu Ser Ser Pro Glu Asp Asp 

580 585 590 

pro Thr Cys Lys Thr Lys Glu Phe Trp Ala Thr lie Leu Asp Ser Ala 

595 600 605 

Gly Pro Leu Pro Asp Tyr Thr Pro Pro Glu lie Phe Ser Phe Glu Ser 

610 615 620 

Thr Thr Gly Phe Thr Leu Tyr Gly Met Leu Tyr Lys Pro His Asp Leu 
625 630 635 640 

Gin Pro Gly Lys Lys Tyr Pro Thr Val Leu Phe He Tyr Gly Gly Arg 
645 650 655 

Val Lys 

<210> 20 

<:211> 4676 

<212> DNA 

<213> Homo sapiens 

<400> 20 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 180 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 240 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 480 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 540 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 1020 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 114 0 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 12 00 

tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 1260 

caaccttttg agattctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 1320 

gagggaaaat atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 138 0 

ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 144 0 

attgagtcag tgcctgattc tgtgacgcca ctaattatct atgaagaaac aacagacatc 1500 

tggataaata tccatgacat ctttcatgtt tttccccaaa gtcacgaaga ggaaattgag 1560 

tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 1620 

ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 1680 

tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggccggcat 1740 

ggatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 1800 

tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 1860 

ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 1920 

agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 1980 

gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagcaggt 2040 

cctcttcctg actatactcc tccagaaatt ttctcttttg aaagtactac tggatttaca 2100 

ttgtatggga tgctctacaa gcctcatgat ctacagcctg gaaagaaata tcctactgtg 2160 

ctgttcatat atggtggtcg ggtcaaatag aaattgacga tcaggtggaa ggactccaat 2220 
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<400> 21 




























Met 


Ala 


Ala 


Ala 


Met 


Glu 


Thr 


Glu 


Gin 


Leu Gly Val 


Glu 


He 


Phe 


Glu 


1 








5 










10 










15 




Thr 


Ala 


Asp 


Cys 


Glu 


Glu 


Asn 


He 


Glu 


Ser 


Gin 


Asp 


Arg 


Pro 


Lys 


Leu 






20 










25 










30 






Glu 


Pro 


Phe 


Tyr 


Val 


Glu 


Arg 


Tyr 


Ser 


Trp 


Ser 


Gin 


Leu 


Lys 


Lys 


Leu 






35 










40 










45 








Leu 


Ala 


Asp 


Thr 


Arg 


Lys 


Tyr 


His 


Gly 


Tyr 


Met 


Met 


Ala 


Lys 


Ala 


Pro 




50 










55 










60 






His 




His 


Asp 


Phe 


Met 


Phe 


Val 


Lys 


Arg 


Asn 


Asp 


Pro 


Asp 


Gly 


Pro 


Ser 


65 








70 










75 










80 


Asp 


Arg 


He 


Tyr 


Tyr 


Leu 


Ala 


Met 


Ser 


Gly 


Glu 


Asn 


Arg 


Glu 


Asn 


Thr 






85 










90 










95 





atctagcttc tcgatatgat ttcattgact tagatcgtgt gggcatccac ggctggtcct 2280 

atggaggata cctctccctg atggcattaa tgcagaggtc agatatcttc agggttgcta 2340 

ttgctggggc cccagtcact ctgtggatct tctatgatac aggatacacg gaacgttata 2400 

tgggtcaccc tgaccagaat gaacagggct attacttagg atctgtggcc atgcaagcag 2460 

aaaagttccc ctctgaacca aatcgtttac tgctcttaca tggtttcctg gatgagaatg 2520 

tccattttgc acataccagt atattactga gttttttagt gagggctgga aagccatatg 2580 

atttacagat ctatcctcag gagagacaca gcataagagt tcctgaatcg ggagaacatt 2640 

atgaactgca tcttttgcac taccttcaag aaaaccttgg atcacgtatt gctgctctaa 2700 

aagtgatata attttgacct gtgtagaact ctctggtata cactggctat ttaaccaaat 2760 

gaggaggttt aatcaacaga aaacacagaa ttgatcatca cattttgata cctgccatgt 2820 

aacatctact cctgaaaata aatgtggtgc catgcagggg tctacggttt gtggtagtaa 2880 

tctaatacct taaccccaca tgctcaaaat caaatgatac atattcctga gagacccagc 2940 

aataccataa gaattactaa aaaaaaaaaa aaaaaaaaga cattagcacc atgtattcat 3 000 

actaccctat tttcactttt aatagtatta taaacttcat gaacttaatt agtgtatttt 3060 

tacagtatac ttttgagttt gttaaaatat gatgatatta gtgattggtt tggttcagtt 3120 

ccagaatctt tgactagtta cagatttgat agcacttaaa tgtaattgaa tagcttatgc 3180 

ttcattgctt gggcatatcc agcatgttat gaactaataa ctattaaact tgacttaacc 3240 

agtcattcat taataatttt tcaaggataa cttagtggcc tcctaaagac acttgttttg 3300 

gcactgacca gtttttagcc aatttaatct gtatctagta taaataattc tcatttttct 3360 

ttgatgatat taacagagtg ggcttttcct tttgcataaa ggctagtaac tgtatatgta 3420 

gcatggattt aattagtcat gatattgata attacaggca gaaaattttt aatcaaatga 3480 

ttagagctta aatatttgca ggcaagtttt tttttttcct ttaagaaaag gaaaaagtac 3540 

acattcacta gaattcttca gaaaatttag tggtgccagt ttccatttgg tatttcctta 3600 

ttaaaatatt ctagaatttt aaggagattg aagggaatca cagtggggtg gggagacctg 3660 

ggtttgggga atgacagaga gaagaggtgg tgagggcctg attaaaaact aagcagaagt 3720 

agttttaaca aaaatactca tgaaaatgtt tggaaactga aatttaaaca actgtaatat 3780 

taaggaaacc agaatcaata aatcactgtc ttgccagcac agctacagag taacatgatt 384 0 

caggggagga aaagttcctt agagttactt ttataattct tttttttttt cctcttaggt 3900 

ttagaaatct tacaaattta aactttatcc ttttaaaatt atttgaacat aatttagata 3960 

ttgtaagctt aaaatacaaa tgtttataga taacctcttt accataaact aatccctggc 4020 

aagccatggc tctctttttt tttttggtgt ttaaagcctg taaacagttt ttctgaatga 4080 

tcatgaactt ttcttggttt agcactagga tttagctatg aagagagctc ataggctttc 4140 

aggtgctaat tgagatctgc cctgttagag tcttggggtg ctagattggt cacattgaca 4200 

ccagtggcag ggaaggcatc tatgagtttg atgcttttta tcacacactt cagtgtttag 4260 

aaagttatta ccaatacttt taaacaacac tccaagaaaa tttgctatat ttctttctca 4320 

tcactacaga gagagtagat ttccccatag agagcacagc ctccattagt aaggttggtg 4380 

actattggta agaggtggac ttcattgaca ccaagtggga ggtagggaaa gcccagaaat 444 0 

ggcaggatga tatggtggtt ctgtcgttgg gaaaggtatt gggttttgct gtttgtattt 4500 

atactgtata atagatacca cgctttttct tattatctgt atatgtattg cttttcatgt 4560 

ttgatatttt cccatgccaa gatttgttta tatatatttt caatgttaaa ttaaattgat 4620 

ttgggtaact ttcttcccca agaaagtatt ttccccctta agtataaatc tgactg 4676 

<210> 21 

<211> 613 

<212> PRT 

<213> Homo sapiens 
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Leu Phe Tyr Ser Glu lie Pro Lys Thr lie Asn Arg Ala Ala Val Leu 

100 105 110 

Met Leu Ser Trp Lys Pro Leu Leu Asp Leu Phe Gin Ala Thr Leu Asp 

115 120 125 

Tyr Gly Met Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg Lys Arg 

130 135 140 

lie Gly Thr Val Gly He Ala Ser Tyr Asp Tyr His Gin Gly Ser Gly 
145 150 155 160 

Thr Phe Leu Phe Gin Ala Gly Ser Gly He Tyr His Val Lys Asp Gly 

165 170 175 

Gly Pro Gin Gly Phe Thr Gin Gin Pro Leu Arg Pro Asn Leu Val Glu 

180 185 190 

Thr Ser Cys Pro Asn He Arg Met Asp Pro Lys Leu Cys Pro Ala Asp 

195 200 205 

Pro Asp Trp He Ala Phe He His Ser Asn Asp He Trp He Ser Asn 

210 215 220 

He Val Thr Arg Glu Glu Arg Arg Leu Thr Tyr Val His Asn Glu Leu 
225 230 235 240 

Ala Asn Met Glu Glu Asp Ala Arg Ser Ala Gly Val Ala Thr Phe Val 

245 250 255 

Leu Gin Glu Glu Phe Asp Arg Tyr Ser Gly Tyr Trp Trp Cys Pro Lys 

260 265 270 

Ala Glu Thr Thr Pro Ser Gly Gly Lys He Leu Arg He Leu Tyr Glu 

275 280 285 

Glu Asn Asp Glu Ser Glu Val Glu He He His Val Thr Ser Pro Met 

290 295 300 

Leu Glu Thr Arg Arg Ala Asp Ser Phe Arg Tyr Pro Lys Thr Gly Thr 
305 310 315 320 

Ala Asn Pro Lys Val Thr Phe Lys Met Ser Glu He Met He Asp Ala 

325 330 335 

Glu Gly Arg He He Asp Val He Asp Lys Glu Leu He Gin Pro Phe 

340 345 350 

Glu He Leu Phe Glu Gly Val Glu Tyr He Ala Arg Ala Gly Trp Thr 

355 360 365 

Pro Glu Gly Lys Tyr Ala Trp Ser lie Leu Leu Asp Arg Ser Gin Thr 

370 375 380 

Arg Leu Gin He Val Leu He Ser Pro Glu Leu Phe He Pro Val Glu 
385 390 395 400 

Asp Asp Val Met Glu Arg Gin Arg Leu He Glu Ser Val Pro Asp Ser 

405 410 415 

Val Thr Pro Leu He He Tyr Glu Glu Thr Thr Asp He Trp He Asn 

420 425 430 

He His Asp He Phe His Val Phe Pro Gin Ser His Glu Glu Glu He 

435 440 445 

Glu Phe He Phe Ala Ser Glu Cys Lys Thr Gly Phe Arg His Leu Tyr 

450 455 460 

Lys He Thr Ser He Leu Lys Glu Ser Lys Tyr Lys Arg Ser Ser Gly 
465 470 475 480 

Gly Leu Pro Ala Pro Ser Asp Phe Lys Cys Pro He Lys Glu Glu He 

485 490 495 

Ala He Thr Ser Gly Glu Trp Glu Val Leu Gly Arg His Gly Ser Asn 

500 505 510 

He Gin Val Asp Glu Val Arg Arg Leu Val Tyr Phe Glu Gly Thr Lys 

515 520 525 

Asp Ser Pro Leu Glu His His Leu Tyr Val Val Ser Tyr Val Asn Pro 

530 535 540 

Gly Glu Val Thr Arg Leu Thr Asp Arg Gly Tyr Ser His Ser Cys Cys 
545 550 555 560 

He Ser Gin His Cys Asp Phe Phe He Ser Lys Tyr Ser Asn Gin Lys 
565 570 575 
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Asn Pro His Cys Val Ser Leu Tyr 
580 

Pro Thr Cys Lys Thr Lys Glu Pfae 
595 600 
Leu Arg Cys Ser Trp 
610 
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Lys Leu Ser Ser Pro Glu Asp Asp 
585 590 
Trp Ala Thr lie Leu Asp Ser Val 
605 



<210> 22 

<211> 4685 

<212> DNA 

<213> Homo sapiens 

<400> 22 

aagtgctaaa gcctccgagg ccaaggccgc tgctactgcc gccgctgctt cttagtgccg 60 

cgttcgccgc ctgggttgtc accggcgccg ccgccgagga agccactgca accaggaccg 120 

gagtggaggc ggcgcagcat gaagcggcgc aggcccgctc catagcgcac gtcgggacgg 18 0 

tccgggcggg gccgggggga aggaaaatgc aacatggcag cagcaatgga aacagaacag 24 0 

ctgggtgttg agatatttga aactgcggac tgtgaggaga atattgaatc acaggatcgg 300 

cctaaattgg agccttttta tgttgagcgg tattcctgga gtcagcttaa aaagctgctt 360 

gccgatacca gaaaatatca tggctacatg atggctaagg caccacatga tttcatgttt 420 

gtgaagagga atgatccaga tggacctcat tcagacagaa tctattacct tgccatgtct 48 0 

ggtgagaaca gagaaaatac actgttttat tctgaaattc ccaaaactat caatagagca 54 0 

gcagtcttaa tgctctcttg gaagcctctt ttggatcttt ttcaggcaac actggactat 600 

ggaatgtatt ctcgagaaga agaactatta agagaaagaa aacgcattgg aacagtcgga 660 

attgcttctt acgattatca ccaaggaagt ggaacatttc tgtttcaagc cggtagtgga 720 

atttatcacg taaaagatgg agggccacaa ggatttacgc aacaaccttt aaggcccaat 780 

ctagtggaaa ctagttgtcc caacatacgg atggatccaa aattatgccc tgctgatcca 840 

gactggattg cttttataca tagcaacgat atttggatat ctaacatcgt aaccagagaa 900 

gaaaggagac tcacttatgt gcacaatgag ctagccaaca tggaagaaga tgccagatca 960 

gctggagtcg ctacctttgt tctccaagaa gaatttgata gatattctgg ctattggtgg 102 0 

tgtccaaaag ctgaaacaac tcccagtggt ggtaaaattc ttagaattct atatgaagaa 1080 

aatgatgaat ctgaggtgga aattattcat gttacatccc ctatgttgga aacaaggagg 1140 

gcagattcat tccgttatcc taaaacaggt acagcaaatc ctaaagtcac ttttaagatg 1200 

tcagaaataa tgattgatgc tgaaggaagg atcatagatg tcatagataa ggaactaatt 1260 

caaccttttg agattctatt tgaaggagtt gaatatattg ccagagctgg atggactcct 1320 

gagggaaaat atgcttggtc catcctacta gatcgctccc agactcgcct acagatagtg 1380 

ttgatctcac ctgaattatt tatcccagta gaagatgatg ttatggaaag gcagagactc 1440 

attgagtcag tgcctgattc tgtgacgcca ctaattatct atgaagaaac aa<:agacatc 1500 

tggataaata tccatgacat ctttcatgtt tttccccaaa gtcacgaaga ggaaattgag 1560 

tttatttttg cctctgaatg caaaacaggt ttccgtcatt tatacaaaat tacatctatt 1620 

ttaaaggaaa gcaaatataa acgatccagt ggtgggctgc ctgctccaag tgatttcaag 1680 

tgtcctatca aagaggagat agcaattacc agtggtgaat gggaagttct tggccggcat 174 0 

ggatctaata tccaagttga tgaagtcaga aggctggtat attttgaagg caccaaagac 1800 

tcccctttag agcatcacct gtacgtagtc agttacgtaa atcctggaga ggtgacaagg 1860 

ctgactgacc gtggctactc acattcttgc tgcatcagtc agcactgtga cttctttata 1920 

agtaagtata gtaaccagaa gaatccacac tgtgtgtccc tttacaagct atcaagtcct 1980 

gaagatgacc caacttgcaa aacaaaggaa ttttgggcca ccattttgga ttcagtcctc 2040 

aggtgcagtt ggtgaataat cggtttaaag gagtcaagta tttccgcttg aataccctag 2100 

cctctctagg ttatgtggtt gtagtgatag acaacagggg atcctgtcac cgagggctta 2160 

aatttgaagg cgcctttaaa tataaaatgg gtcaaataga aattgacgat caggtggaag 2220 

gactccaata tctagcttct cgatatgatt tcattgactt agatcgtgtg ggcatccacg 2280 

gctggtccta tggaggatac ctctccctga tggcattaat gcagaggtca gatatcttca 2340 

gggttgctat tgctggggcc ccagtcactc tgtggatctt ctatgataca ggatacacgg 2400 

aacgttatat gggtcaccct gaccagaatg aacagggcta ttacttagga tctgtggcca 2460 

tgcaagcaga aaagttcccc tctgaaccaa atcgtttact gctcttacat ggtttcctgg 2520 

atgagaatgt ccattttgca cataccagta tattactgag ttttttagtg agggctggaa 2580 

agccatatga tttacagatc tatcctcagg agagacacag cataagagtt cctgaatcgg 2640 

gagaacatta tgaactgcat cttttgcact accttcaaga aaaccttgga tcacgtattg 2700 

ctgctctaaa agtgatataa ttttgacctg tgtagaactc tctggtatac actggctatt 2760 

taaccaaatg aggaggttta atcaacagaa aacacagaat tgatcatcac attttgatac 2820 
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ctgccatgta 
tggtagtaat 
agacccagca 
tgtattcata 
gtgtattttt 
ggttcagttc 
agcttatgct 
gacttaacca 
cttgttttgg 
catttttctt 
gtatatgtag 
atcaaatgat 
aaaaagtaca 
atttccttat 
ggagacctgg 
agcagaagta 
ctgtaatatt 
aacatgattc 
ctcttaggtt 
atttagatat 
atccctggca 
tctgaatgat 
taggctttca 
acattgacac 
agtgtttaga 
tctttctcat 
aggttggtga 
cccagaaatg 
tttgtattta 
ttttcatgtt 
taaattgatt 
gactg 



acatctactc 
ctaatacctt 
ataccataag 
ctaccctatt 
acagtatact 
cagaatcttt 
tcattgcttg 
gtcattcatt 
cactgaccag 
tgatgatatt 
catggattta 
tagagcttaa 
cattcactag 
taaaatattc 
gtttggggaa 
gttttaacaa 
aaggaaacca 
aggggaggaa 
tagaaatctt 
tgtaagctta 
agccatggct 
catgaacttt 
ggtgctaatt 
cagtggcagg 
aagttattac 
cactacagag 
ctattggtaa 
gcaggatgat 
tactgtataa 
tgatattttc 
tgggtaactt 



ctgaaaataa 
aaccccacat 
aattactaaa 
ttcactttta 
tttgagtttg 
gactagttac 
ggcatatcca 
aataattttt 
tttttagcca 
aacagagtgg 
attagtcatg 
atatttgcag 
aattcttcag 
tagaatttta 
tgacagagag 
aaatactcat 
gaatcaataa 
aagttcctta 
acaaatttaa 
aaatacaaat 
ctcttttttt 
tcttggttta 
gagatctgcc 
gaaggcatct 
caatactttt 
agagtagatt 
gaggtggact 
atggtggttc 
tagataccac 
ccatgccaag 
tcttccccaa 



atgtggtgcc 
gctcaaaatc 
aaaaaaaaaa 
atagtattat 
ttaaaatatg 
agatttgata 
gcatgttatg 
caaggataac 
atttaatctg 
gcttttcctt 
atattgataa 
gcaagttttt 
aaaatttagt 
aggagattga 
aagaggtggt 
gaaaatgttt 
atcactgtct 
gagttacttt 
actttatcct 
gtttatagat 
ttttggtgtt 
gcactaggat 
ctgttagagt 
atgagtttga 
aaacaacact 
tccccataga 
tcattgacac 
tgtcgttggg 
gctttttctt 
atttgtttat 
gaaagtattt 



atgcaggggt 
aaatgataca 
aaaaaaagac 
aaacttcatg 
atgatattag 
gcacttaaat 
aactaataac 
ttagtggcct 
tatctagtat 
ttgcataaag 
ttacaggcag 
ttttttcctt 
ggtgccagtt 
agggaatcac 
gagggcctga 
ggaaactgaa 
tgccagcaca 
tataattctt 
tttaaaatta 
aacctcttta 
taaagcctgt 
ttagctatga 
cttggggtgc 
tgctttttat 
ccaagaaaat 
gagcacagcc 
caagtgggag 
aaaggtattg 
attatctgta 
atatattttc 
tcccccttaa 



ctacggtttg 
tattcctgag 
attagcacca 
aacttaatta 
tgattggttt 
gtaattgaat 
tattaaactt 
cctaaagaca 
aaataattct 
gctagtaact 
aaaattttta 
taagaaaagg 
tccatttggt 
agtggggtgg 
ttaaaaacta 
atttaaacaa 
gctacagagt 
tttttttttc 
tttgaacata 
ccataaacta 
aaacagtttt 
agagagctca 
tagattggtc 
cacacacttc 
ttgctatatt 
tccattagta 
gtagggaaag 
ggttttgctg 
tatgtattgc 
aatgttaaat 
gtataaatct 



<210> 23 

<211> 892 

<212> PRT 

<213> Homo 

<400> 23 



sapiens 



Met 


Arg 


Lys 


val 


Lys 


Lys 


Leu 


Arg 


Leu 


Asp 


Lys 


Glu 


Asn 


Thr 


Gly 


Ser 


1 




5 










10 










15 




Trp 


Arg 


Ser 


Phe 


Ser 


Leu 


Asn 


Ser 


Glu 


Gly 


Ala 


Glu 


Arg 


Met 


Ala 


Thr 




20 










25 










30 






Thr 


Gly 


Thr 


Pro 


Thr 


Ala 


Asp 


Arg 


Gly 


Asp 


Ala 


Ala 


Ala 


Thr 


Asp 


Asp 




35 










40 










45 








Pro 


Ala 


Ala 


Arg 


Phe 


Gin 


Val 


Gin 


Lys 


His 


Ser 


Trp 


Asp 


Gly 


Leu 


Arg 




50 








55 










60 










Ser 


He 


He 


His 


Gly 


Ser 


Arg 


Lys 


Tyr 


Ser 


Gly 


Leu 


He 


Val 


Asn 


Lys 


65 










70 










75 










80 


Ala 


Pro 


His 


Asp 


Phe 


Gin 


Phe 


Val 


Gin 


Lys 


Thr 


Asp 


Glu 


Ser 


Gly 


Pro 








85 










90 










95 




His 


Ser 


His 


Arg 
100 


Leu 


Tyr 


Tyr 


Leu 


Gly 
105 


Met 


Pro 


Tyr 


Gly 


Ser 
110 


Arg 


Glu 


Asn 


Ser 


Leu 


Leu 


Xyr 


Ser 


Glu 


He 


Pro 


Lys 


Lys 


Val 


Arg 


Lys 


Glu 


Ala 






115 








120 










125 








Leu 


Leu 
130 


Leu 


Leu 


Ser 


Trp 


Lys 
135 


Gin 


Met 


Leu 


Asp 


His 

14 0 


Phe 


Gin 


Ala 


Thr 


Pro 


His 


His 


Gly 


Val 


Tyr 


Ser 


Arg 


Glu 


Glu 


Glu 


Leu 


Leu 


Arg 


Glu 


Arg 


145 








150 










155 










160 


Lys 


Arg 


Leu 


Gly 


Val 


Phe 


Gly 


He 


Thr 


Ser 


Tyr 


Asp 


Phe 


His 


Ser 


Glu 



2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4685 
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Ser Gly Leu Phe 
180 

Asp Gly Gly Lys 
195 

lie Lys Thr Gin 
210 

Ala Asp Pro Ala 
225 

Ala Asn lie Glu 

Gly Leu Ser Asn 
260 

Phe Val lie Gin 
275 

Pro Thr Ala Ser 
290 

Leu Tyr Glu Glu 
305 

Ser Pro Ala Leu 

Thr Gly Ser Lys 
340 

Thr Asp Ser Gin 
355 

Gin Pro Phe Ser 
370 

Gly Trp Thr Arg 
385 

Pro Gin Gin Trp 

Pro Ser Thr Glu 
420 

Pro Arg Asn Val 
435 

Trp lie Asn Val 
450 

Glu Asp Glu Leu 
465 

Cys His Leu Tyr 

Trp Ser Glu Pro 
500 

Lys Glu Glu lie 
515 

His Gly Ser Lys 
530 

Gin Gly Thr Lys 
545 

Tyr Glu Ala Ala 

His Ser Cys Ser 
580 

Ser Ser Val Ser 
595 

Pro Asp Asp Asp 
610 

Met Glu Ala Ala 
625 

His Phe His Thr 



165 

Leu Phe Gin Ala 

Asn Gly Phe Met 
200 

Cys Ser Gly Pro 
215 

Phe Phe Ser Phe 
230 

Thr Gly Glu Glu 

245 

Val Leu Asp Asp 

Glu Glu Phe Asp 
280 

Trp Glu Gly Ser 
295 

Val Asp Glu Ser 
310 

Glu Glu Arg Lys 
325 

Asn Pro Lys lie 

Gly Lys lie Val 
360 

Ser Leu Phe Pro 
375 

Asp Gly Lys Tyr 
390 

Leu Gin Leu Val 
405 

Asn Glu Glu Gin 

Gin Pro Tyr Val 
440 

His Asp lie Phe 
455 

Cys Phe Leu Arg 
470 

Lys Val Thr Ala 
485 

Phe Ser Pro Gly 

Ala Leu Thr Ser 
520 

lie Trp Val Asn 
535 

Asp Thr Pro Leu 
550 

Gly Glu lie Val 

565 

Met Ser Gin Asn 

Thr Pro Pro Cys 
600 

Pro Leu His Lys 
615 

Ser Cys Pro Pro 
630 

Arg Ser Asp Val 



170 

Ser Asn Ser Leu 
185 

Val Ser Pro Met 

Arg Met Asp Pro 
220 

lie Asn Asn Ser 
235 

Arg Arg Leu Thr 
250 

Pro Lys Ser Ala 
265 

Arg Phe Thr Gly 

Glu Gly Leu Lys 
300 

Glu Val Glu Val 
315 

Thr Asp Ser Tyr 
330 

Ala Leu Lys Leu 
345 

Ser Thr Gin Glu 

Lys Val Glu Tyr 
380 

Ala Trp Ala Met 
395 

Leu Leu Pro Pro 
410 

Arg Leu Ala Ser 
425 

Val Tyr Glu Glu 

Tyr Pro Phe Pro 
460 

Ala Asn Glu Cys 
475 

Val Leu Lys Ser 
490 

Glu Asp Glu Phe 
505 

Gly Glu Trp Glu 

Glu Glu Thr Lys 
540 

Glu His His Leu 
555 

Arg Leu Thr Thr 
570 

Phe Asp Met Phe 
585 

Val His Val Tyr 

Gin pro Arg Phe 
620 

Asp Tyr Val Pro 
635 

Arg Leu Tyr Gly 



175 

Phe His Cys Arg 
190 

Lys Pro Leu Glu 
205 

Lys lie Cys Pro 

Asp Leu Trp Val 
240 

Phe Cys His Gin 
255 

Gly Val Ala Thr 
270 

Tyr Trp Trp Cys 
285 

Thr Leu Arg lie 

lie His Val Pro 
320 

Arg Tyr Pro Arg 
335 

Ala Glu Phe Gin 
350 

Lys Glu Leu Val 
365 

lie Ala Arg Ala 

Phe Leu Asp Arg 
400 

Ala Leu Phe lie 
415 

Ala Arg Ala Val 
430 

Val Thr Asn Val 
445 

Gin Ser Glu Gly 

Lys Thr Gly Phe 
480 

Gin Gly Tyr Asp 
495 

Lys Cys Pro lie 
510 

val Leu Ala Arg 
525 

Leu Val Tyr Phe 

Tyr Val Val Ser 
560 

Pro Gly Phe Ser 
575 

val Ser His Tyr 
590 

Lys Leu Ser Gly 
605 

Trp Ala Ser Met 

Pro Glu lie Phe 
640 

Met lie Tyr Lys 
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Pro His Ala Leu 
660 

Tyr Gly Gly Pro 
675 

Lys Tyr Leu Arg 
690 

Val lie Asp Gly 
705 

Ala Leu Lys Asn 

Gly Leu Gin Phe 
740 

Val Ala lie His 
755 

Leu lie His Lys 
770 

Val Thr Val Trp 
785 

Asp Val Pro Glu 

Leu His Val Glu 
820 

His Gly Phe Leu 
835 

Val Ser Gin Leu 
850 

Pro Asn Glu Arg 
865 

Glu Val Thr Leu 



645 

Gin Pro Gly Lys 

Gin Val Gin Leu 
680 

Leu Asn Thr Leu 
695 

Arg Gly Ser Cys 
710 

Gin Met Gly Gin 
725 

Val Ala Glu Lys 

Gly Trp Ser Tyr 
760 

Pro Gin Val Phe 
775 

Met Ala Tyr Asp 

790 

Asn Asn Gin His 
805 

Lys Leu Pro Asn 

Asp Glu Asn Val 
840 

lie Arg Ala Gly 
855 

His Ser He Arg 
870 

Leu His Phe Leu 
885 



650 

Lys His Pro Thr 
665 

Val Asn Asn Ser 

Ala Ser Leu Gly 
700 

Gin Arg Gly Leu 
715 

Val Glu He Glu 
730 

Tyr Gly Phe He 
745 

Gly Gly Phe Leu 

Lys Val Ala He 
780 

Thr Gly Tyr Thr 
795 

Gly Tyr Glu Ala 
810 

Glu Pro Asn Arg 
825 

His Phe Phe His 

Lys Pro Tyr Gin 
860 

Cys Pro Glu Ser 
875 

Gin Glu Tyr Leu 
890 



655 

Val Leu Phe Val 
670 

Phe Lys Gly He 
685 

Tyr Ala Val Val 

Arg Phe Glu Gly 
720 

Asp Gin Val Glu 
735 

Asp Leu Ser Arg 
750 

Ser Leu Met Gly 
765 

Ala Gly Ala Pro 

Glu Arg Tyr Met 
800 

Gly Ser Val Ala 
815 

Leu Leu He Leu 
830 

Thr Asn Phe Leu 
845 

Leu Gin He Tyr 

Gly Glu His Tyr 
880 



<210> 24 

<211> 4302 

<212> DNA 

<213 > Homo sapiens 

<400> 24 



caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 36 0 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 54 0 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 84 0 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 1140 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 
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ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 144 0 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 1920 

aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 1980 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 2160 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggctct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 

ggcaggggct cctgtcagcg agggcttcgg ttcgaagggg ccctgaaaaa ccaaatgggc 2460 

caggtggaga tcgaggacca ggtggagggc ctgcagttcg tggccgagaa gtatggcttc 2520 

atcgacctga gccgagttgc catccatggc tggtcctacg ggggcttcct ctcgctcatg 2580 

gggctaatcc acaagcccca ggtgttcaag gtggccatcg cgggtgcccc ggtcaccgtc 2640 

tggatggcct acgacacagg gtacactgag cgctacatgg acgtccctga gaacaaccag 2700 

cacggctatg aggcgggttc cgtggccctg cacgtggaga agctgcccaa tgagcccaac 2 760 

cgcttgctta tcctccacgg cttcctggac gaaaacgtgc actttttcca cacaaacttc 2820 

ctcgtctccc aactgatccg agcagggaaa ccttaccagc tccagatcta ccccaacgag 2880 

agacacagta ttcgctgccc cgagtcgggc gagcactatg aagtcacgtt gctgcacttt 2 94 0 

ctacaggaat acctctgagc ctgcccaccg ggagccgcca catcacagca caagtggctg 3 000 

cagcctccgc ggggaaccag gcgggaggga ctgagtggcc cgcgggcccc agtgaggcac 3 060 

tttgtcccgc ccagcgctgg ccagccccga ggagccgctg ccttcaccgc cccgacgcct 312 0 

tttatccttt tttaaacgct cttgggtttt atgtccgctg cttcttggtt gccgagacag 3180 

agagatggtg gtctcgggcc agcccctcct ctccccgcct tctgggagga ggaggtcaca 3 24 0 

cgctgatggg cactggagag gccagaagag actcagagga gcgggctgcc ttccgcctgg 33 00 

ggctccctgt gacctctcag tcccctggcc cggccagcca ccgtccccag cacccaagca 3360 

tgcaattgcc tgtccccccc ggccagcctc cccaacttga tgtttgtgtt ttgtttgggg 3420 

ggatattttt cataattatt taaaagacag gccgggcgcg gtggctcacg tctgtaatcc 348 0 

cagcactttg ggaggctgag gcgggcggat cacctgaggt tgggagttca agaccagcct 3 54 0 

ggccaacatg gggaaacccc gtctctacta aaaatacaaa aaattagccg ggtgtggtgg 360 0 

cgcgtgccta taatcccagc tactcgggag gctgaggcag gagaatcgct tgaacccggg 366 0 

aggtggaggt tgcggtgagc caagatcgca ccattgcact ccagcctggg caacaagagc 372 0 

gaaactctgt ctcaaaataa ataaaaaata aaagacagaa agcaaggggt gcctaaatct 378 0 

agacttgggg tccacaccgg gcagcggggt tgcaacccag cacctggtag gctccatttc 3 84 0 

ttcccaagcc cgagcagagg gtcatgcggg ccccacagga gaagcggcca gggcccgcgg 3 900 

ggggcaccac ctgtggacag ccctcctgtc cccaagcttt caggcaggca ctgaaacgca 3 960 

ccgaacttcc acgctctgct ggtcagtggc ggctgtcccc tccccagccc agccgcccag 402 0 

ccacatgtgt ctgcctgacc cgtacacacc aggggttccg gggttgggag ctgaaccatc 4 080 

cccacctcag ggttatattt ccctctcccc ttccctcccc gccaagagct ctgccagggg 414 0 

cgggcaaaaa aaaaagtaaa aagaaaagaa aaaaaaaaaa aagaaacaaa ccacctctac 4200 

atattatgga aagaaaatat ttttgtcgat tcttattctt ttataattat gcgtggaaga 4260 

agtagacaca ttaaacgatt ccagttggaa acatgtcacc tg 4302 

<210> 25 

<211> 518 

<212> PRT 

<213> Homo sapiens 

<400> 25 



Met Arg Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser 

1 5 10 15 

Trp Arg Ser Phe Ser Leu Asn Ser Glu Gly Ala Glu Arg Met Ala Thr 
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20 










25 










30 






Thr 


Gly 


Thr 


Pro 


Thr Ala Asp Arg Gly Asp 


Ala 


Ala 


Ala 


Thr Asp Asp 




35 










40 










45 








Pro 


Ala 


Ala 


Arg 


Phe 


Gin 


Val 


Gin 


Lys 


His 


Ser 


Trp 


Asp 


Gly 


Leu Arg 




50 










55 










60 










Ser 


lie 


He 


His 


Gly Ser Arg Lys 


Tyr 


Ser Gly Leu 


He 


Val 


Asn 


Lys 


65 










70 










75 










80 


Ala 


Pro 


His 


Asp 


Phe 


Gin 


Phe 


Val 


Gin Lys 


Thr 


Asp 


Glu 


Ser Gly Pro 










85 










90 










95 




His 


Ser 


His 


Arg 


Leu Tyr Tyr Leu Gly Met 


Pro 


Tyr 


Gly 


Ser 


Arg 


Glu 








100 










105 










110 






Asn 


Ser 


Leu 


Leu 


Tvr 


Ser 


Glu 


He 


Pro 


Lys 


Lys 


Val 


Arg 


Lys 


Glu 


Ala 






115 










120 










125 








Leu 


Leu 


Leu 


Leu 


Ser 


Trp 


Lys 


Gin 


Met 


Leu 


Asp 


His 


Phe 


Gin 


Ala 


Thr 




130 










135 










140 










Pro 


His 


His 


Gly 


Val 


Tvr 

* J; 


Ser Arg 


Glu 


Glu 


Glu 


Leu 


Leu 


Arg 


Glu Arg 


145 








150 










155 






His 




160 


Lys 


Arg 


Leu 


Gly 


Val 


Phe 


Gly 


He 


Thr 


Ser 


Tyr Asp 


Phe 


Ser 


Glu 






165 










170 










175 




Ser 


Gly 


Leu 


Phe 


Leu 


Phe 


Gin 


Ala 


Ser 


Asn 


Ser 


Leu 


Phe 


His 


Cys 


Arg 






180 










185 










190 






Asp 


Gly 


Gly 


Lys 


Asn 


Gly 


Phe 


Met 


Val 


Ser 


Pro 


Met 


Lys 


Pro 


Leu 


Glu 


195 










200 










205 








lie 


Lys 


Thr 


Gin 


Cvs 


Ser 


Gly Pro Arg 


Met 


Asp 


Pro 


Lys 


He 


Cys 


Pro 




210 










215 










220 










Ala 


Asp 


Pro 


Ala 


Phe 


Phe 


Ser 


Phe 


He 


Asn 


Asn 


Ser 


Asp 


Leu 


Trp 


Val 


225 








230 










235 








His 


240 


Ala 


Asn 


He 


Glu 


Thr 


Gly 


Glu 


Glu Arg 


Arg 


Leu 


Thr 


Phe 


Cys 


Gin 










245 










250 










255 




Gly 


Leu 


Ser 


Asn 


val 


Leu 


Asp 


Asp 


Pro 


Lys 


Ser Ala Gly Val Ala 


Thr 






260 










265 










270 






Phe 


Val 


He 


Gin 


Glu 


Glu 


Phe 


Asp Arg 


Phe 


Thr Gly Tyr 


Trp 


Trp 


Cys 






275 










280 










285 








Pro 


Thr 


Ala 


Ser 


Trp 


Glu Gly 


Ser 


Glu 


Gly 


Leu 


Lys 


Thr 


Leu 


Arg 


He 




290 










295 










300 




His 






Leu 


Tyr 


Glu 


Glu 


Val 


Asp 


Glu 


Ser 


Glu 


Val 


Glu 


Val 


He 


Val 


Pro 


305 








310 










315 










320 


Ser 


Pro 


Ala 


Leu 


Glu 


Glu 


Arg 


Lys 


Thr 


Asp 


Ser 


Tyr 


Arg 


Tyr 


Pro 


Arg 










325 










330 










335 




Thr 


Gly 


Ser 


Lys 


Asn 


Pro 


Lys 


He 


Ala 


Leu 


Lys 


Leu 


Ala 


Glu 


Phe 


Gin 






340 










345 










350 






Thr 


Asp 


Ser 


Gin 


Gly Lys 


He 


Val 


Ser 


Thr 


Gin 


Glu 


Lys 


Glu 


Leu 


Val 




355 










360 










365 








Gin 


Pro 


Phe 


Ser 


Ser 


Leu 


Phe 


Pro 


Lys 


Val 


Glu 


Tyr 


He 


Ala 


Arg Ala 




370 










375 










380 










Gly Trp 


Thr 


Arg 


Asp 


Gly Lys 


Tyr 


Ala 


Trp 


Ala 


Met 


Phe 


Leu 


Asp 


Arg 


385 










390 










395 










400 


Pro 


Gin 


Gin 


Trp 


Leu 


Gin 


Leu 


Val 


Leu 


Leu 


Pro 


Pro 


Ala 


Leu 


Phe 


He 










405 










410 










415 




Pro 


Ser 


Thr 


Glu 


Asn 


Glu 


Glu 


Gin Arg 


Leu 


Ala 


Ser 


Ala 


Arg 


Ala 


Val 








420 










425 










430 






Pro 


Arg 


Asn 


Val 


Gin 


Pro 


Tyr 


Val 


Val 


Tyr 


Glu 


Glu 


Val 


Thr 


Asn 


Val 




435 










440 










445 








Trp 


He 


Asn 


Val 


His 


Asp 


He 


Phe 


Tyr 


pro 


Phe 


Pro 


Gin 


Ser 


Glu 


Gly 


450 










455 










460 










Glu 


Asp 


Glu 


Leu 


Cys 


Phe 


Leu 


Arg 


Ala 


Asn 


Glu 


Cys 


Lys 


Thr 


Gly 


Phe 


465 








470 










475 










480 


Cys 


His 


Leu 


Tyr 


Lys 


Val 


Thr 


Ala 


Val 


Leu 


Lys 


Ser 


Gin 


Gly 


Tyr Asp 








485 










490 










495 




Trp 


Ser 


Glu 


Pro 


Phe 


Ser 


Pro 


Gly 


Glu 


Gly 


Glu 


Gin 


Ser 


Leu 


Thr 


Asn 
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500 



505 



510 



Ala val Asp sex Ser Arg 





515 


<210> 


26 


<211> 


2411 


<212> 


DNA 


<213> 


Homo sapiens 


<400> 


26 



c.g9ccgc==9 cctgggtcgc tcaacttccg gg^t f^^^ l^^H 
t9tgtccg=c gcgectgtog tcccccgctc -a-^Jtc =9ggg_^^g^^ fggccgccgc 

— iS ~ ~ ?SgS"4 

-I gg ! S g?cgSr4 ^.5^^= 

^^^^oac?a cccqcttcS ggtgcagaal cactcgtggg acgggctccg gagcatcatc 
gacccggccg ^^^atac" ||gcctcatt gtcaacaagg cgccccacga cttccagttt 
cacggcagcc 3caagtactc 999^ tcccaccgcc tctactacct gggaatgcca 

gtgcagaaga cgQ^^gagtc ^gggc tctgagattc ccaagaaggt ccggaaagag 

""TraTac ?cctgtcctg gaagcagatg ctggaLatt tccaggccac gccccaccat 
^aoatcta?t ctcgSaggI Igalctlctl agggagcgga aacgcctggg 99tcttcggc 
ggggtctact ^^cgggagg ^ ggcctcttcc tcttccaggc cagcaacagc 

atcacctcct acgacttcca cagcgagag 9^ tgtcccctat gaaaccgctg 

ctcttccact gccgcgacgg cggcaagaac 9gttcatgg J^^^^^^^^ tgccgaccct 

^""tllllTt cctJca?caa tSSgcga? ct|?gggtgg ccaacatcga gacaggcgag 
gccttcttct ccttcatcaa 9 9 ^tatccaatg tcctggatga ccccaagtct 

gagcggcggc ^gaccttctg miH^ gagttcgacc gcttcactgg gtactggtgg 
gcgggtgtgg ^^^^^"^^t ^^^acaggaa 9 9 9 g^^^ cctgtatgag 

tgccccacag cctcctggga ^ggtccagag 99 ^ ctcctgcgct agaagaaagg 

gaagtcgatg agtccgaggt ggaggcatt cacgtcccc^ ,,,,i4at cgccttgaaa 
aagacggact ^^^atcggta ^^^^^gg 99 9 ^ ^^^^^gga gaaggagctg 

ctggctgagt "cagactga "9ccagggc 9 9 tcgccagggc cgggtggacc 
^^^"TaTcl aaScgcctg Igccat|tt! ct^accggc cccagcagtg gctccagctc 
IflltclTcc ccccglccc? I?tcat?ccg agcacagaga atgaggagca g^ggctagcc 
fc^e^farag f.gf c J ^ . c.gta.^^^^^^ .g™^ SS^— 
gtctggatca atgttcatga ^atcttctac gccatttgta caaagtcacc 

--"i i! sfss 

IZll^ll l%%illfc Sctcagcc gcacaccctg cgggagaagg aacaggga.c 
ggcaggaagc cagccttccc <=s9t9«"9C "?""9gc ^?|| c=ttagca=. 
gttggcttat tcaggcagca S.ttlactga 9=«=w^= cccLcallt gtgttcataa 

'r*!S?ftt llllTolTs gccaeagfc SSttlc ag.gactgtc .gtggtcgct 

J c jgg.a « ccagcagaaa c.g«g.gc. .ggccaa.- 

ttactgtcta gccctttgta gaaacatttg ^^J9^ ^ ttgggacctt ctagtggagg 
TillllT. ""c lllTI^^ 9tf=?gtggg tggcag.tgc 

agatcctagt c 



<210> 27 

<211> 892 

<212> PRT 

<213> Homo sapiens 

<400> 27 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2411 



„et Arg Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly ser 

C 10 
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Trp 


Arg 


Ser 


Phe 








20 


Thr Gly 


Thr 


Pro 






35 




pro 


Ala 


Ala 


Arg 




50 






Ser 


lie 


He 


His 


65 








Ala 


Pro 


His 


Asp 




Ser 


His 


Arg 








100 


Asn 


ser 


Leu 


Leu 






115 




Leu 


Leu 


Leu 


Leu 




130 






Pro 


His 


His 


Gly 


145 








Lys 


Arg 


Leu 


Gly 


Ser 


Gly 


Leu 


Phe 








180 


Asp 


Gly 


Gly 


Lys 










lie 


Lys 


Thr 


Gin 




210 






Ala 


Asp 


Pro 


Ala 


225 








Ala 


Asn 


He 


Glu 


Gly 


Leu 


Ser 


Asn 








260 


Phe 


Val 


He 


Gin 






one. 




Pro 


Thr 


Ala 


Ser 




290 






Leu 


Tyr 


Glu 


Glu 


305 








Ser 


Pro 


Ala 


Leu 


Thr 


Gly 


Ser 


Lys 








340 


Thr 


Asp 


Ser 


Gin 






355 




Gin 


Pro 


Phe 


Ser 




370 






Gly Trp 


Thr 


Arg 


385 








Pro 


Gin 


Gin 


Trp 


Pro 


Ser 


Thr 


Glu 








420 


Pro 


Arg 


Asn 


Val 






435 




Trp 


He 


Asn 


Val 




450 






Glu 


Asp 


Glu 


Leu 


465 








Cys 


His 


Leu 


Tyr 



Ser 


Leu 


Asn 


Ser 


Thr 


Ala 


Asp 


Arg 








40 


Phe 


Gin 


Val 


Gin 






55 




Gly 


Ser 


Arg 


Lys 




70 






Phe 


Gin 


Phe 


Val 


85 








Leu 


Tyr 


Tyr 


Leu 


Xyr 


Ser 


Glu 


He 








120 


Ser 


Tzp 


Lys 


Gin 






J. J 3 




val 


Tyr 


Ser 


Arg 




150 






Val 


Phe 


Gly 


He 


165 








Leu 


Phe 


Gin 


Ala 


Asn 


Gly 


Phe 


Met 








200 


Cys 


Ser 


Gly 


Pro 










Phe 


Phe 


Ser 


Phe 




230 






Thr 


Gly 


Glu 


Glu 


245 








Val 


Leu 


Asp 


Asp 


Glu 


Glu 


Phe 


Asp 








280 


Trp 


Glu 


Gly 


Ser 






9 Q R 




Val 


Asp 


Glu 


Ser 




310 






Glu 


Glu 


Arg 


Lys 


325 








Asn 


pro 


Lys 


He 


Gly 


Lys 


He 


Val 



360 

Ser Leu Phe Pro 

375 

Asp Gly Lys Tyr 
390 

Leu Gin Leu Val 
405 

Asn Glu Glu Gin 

Gin Pro Tyr Val 
440 

His Asp He Phe 
455 

Cys Phe Leu Arg 
470 

Lys Val Thr Ala 
485 



Glu Gly Ala Glu 

25 

Gly Asp Ala Ala 

Lys His Ser Trp 
60 

Tyr Ser Gly Leu 
75 

Gin Lys Thr Asp 
90 

Gly Met Pro Tyr 
105 

Pro Lys Lys Val 

Met Leu Asp His 
140 

Glu Glu Glu Leu 
155 

Thr Ser Tyr Asp 
170 

Ser Asn Ser Leu 
185 

Val Ser Pro Met 

Arg Met Asp Pro 
220 

He Asn Asn Ser 
235 

Arg Arg Leu Thr 
250 

Pro Lys Ser Ala 
265 

Arg Phe Thr Gly 

Glu Gly Leu Lys 
300 

Glu Val Glu Val 
315 

Thr Asp Ser Tyr 
330 

Ala Leu Lys Leu 
345 

Ser Thr Gin Glu 

Lys Val Glu Tyr 
380 

Ala Trp Ala Met 
395 

Leu Leu Pro Pro 
410 

Arg Leu Ala Ser 
425 

Val Tyr Glu Glu 

Tyr Pro Phe Pro 
460 

Ala Asn Glu Cys 
475 

Val Leu Lys Ser 
490 



Arg Met Ala Thr 
30 

Ala Thr Asp Asp 
45 

Asp Gly Leu Arg 

He Val Asn Lys 
80 

Glu Ser Gly Pro 
95 

Gly Ser Arg Glu 
110 

Arg Lys Glu Ala 
125 

Phe Gin Ala Thr 

Leu Arg Glu Arg 
160 

Phe His Ser Glu 
175 

Phe His Cys Arg 
190 

Lys Pro Leu Glu 
205 

Lys He Cys Pro 

Asp Leu Trp Val 
240 

Phe Cys His Gin 
255 

Gly Val Ala Thr 
270 

Tyr Trp Trp Cys 
285 

Thr Leu Arg He 

He His Val Pro 
320 

Arg Tyr Pro Arg 
335 

Ala Glu Phe Gin 
350 

Lys Glu Leu Val 
365 

He Ala Arg Ala 

Phe Leu Asp Arg 
400 

Ala Leu Phe He 
415 

Ala Arg Ala Val 
430 

Val Thr Asn Val 
445 

Gin Ser Glu Gly 

Lys Thr Gly Phe 
480 

Gin Gly Tyr Asp 

495 
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Trp Ser Glu Pro Phe Ser Pro Gly Glu Asp Glu Phe Lys Cys Pro He 

500 505 510 

LYS Glu Glu He Ala Leu Thr Ser Gly Glu Trp Glu Val Leu Ala Arg 

515 520 525 

His Gly ser Lys He Trp Val Asn Glu Glu Thr Lys Leu Val Tyr Phe 

530 535 540 

Gin Gly Thr Lys Asp Thr Pro Leu Glu His His Leu Tyr Val Val Ser 
545 550 555 560 

Tvr Glu Ala Ala Gly Glu He Val Arg Leu Thr Thr Pro Gly Phe Ser 

^ 565 570 575 

His ser cys Ser Met Ser Gin Asn Phe Asp Met Phe Val Ser His Tyr 

580 585 590 

ser ser Val Ser Thr Pro Pro Cys val His Val Tyr Lys Leu Ser Gly 

595 600 605 

Pro Asp Asp Asp Pro Leu His Lys Gin Pro Arg Phe Trp Ala Ser Met 

610 615 620 

Met Glu Ala Ala Ser Cys Pro Pro Asp Tyr Val Pro Pro Glu He Phe 
625 630 635 640 

His Phe His Thr Arg Ser Asp Val Arg Leu Tyr Gly Met He Tyr Lys 

645 650 655 

Pro His Ala Leu Gin Pro Gly Lys Lys His Pro Thr Val Leu Phe Val 

660 665 670 

Tvr Gly Gly Pro Gin Val Gin Leu Val Asn Asn Ser Phe Lys Gly He 

^ 675 680 685 

Lvs Tyr Leu Arg Leu Asn Thr Leu Ala' Ser Leu Gly Tyr Ala Val Val 

690 695 700 

val He Asp Gly Arg Gly Ser Cys Gin Arg Gly Leu Arg Phe Glu Gly 
705 710 715 720 

Ala Leu Lys Asn Gin Met Gly Gin Val Glu He Glu Asp Gin Val Glu 

725 730 735 

Gly Leu Gin Phe val Ala Glu Lys Tyr Gly Phe He Asp Leu Ser Arg 

740 745 750 

val Ala He His Gly Trp Ser Tyr Gly Gly Phe Leu Ser Leu Met Gly 

755 760 765 

Leu He His Lys Pro Gin Val Phe Lys Val Ala He Ala Gly Ala Pro 

770 775 780 

val Thr val Trp Met Ala Tyr Asp Thr Gly Tyr Thr Glu Arg Tyr Met 

790 795 800 

i^D val Pro Glu Asn Asn Gin His Gly Tyr Glu Ala Gly Ser Val Ala 

805 810 815 

Leu His Val Glu Lys Leu Pro Asn Glu Pro Asn Arg Leu Leu He Leu 

820 825 830 

His Gly Phe Leu Asp Glu Asn Val His Phe Phe His Thr Asn Phe Leu 

835 840 845 

val Ser Gin Leu He Arg Ala Gly Lys Pro Tyr Gin Leu Gin He Tyr 
850 855 860 
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Pro Asn Glu Arg His Ser He Arg Cys Pro Glu Ser Gly Glu His Tyr 
865 870 875 880 

Glu Val Thr Leu Leu His Phe Leu Gin Glu Tyr Leu 
885 890 

<210> 28 

<211> 4219 

<212> DNA 

<213> Homo sapiens 

<400> 28 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 12 0 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 18 0 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 3 00 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 3 60 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 54 0 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 114 0 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 12 0 0 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 126 0 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 13 80 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 144 0 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 162 0 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 192 0 

aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 198 0 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 216 0 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggctct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 

ggcaggggct cctgtcagcg agggcttcgg ttcgaagggg ccctgaaaaa ccaaatgggc 2460 

caggtggaga tcgaggacca ggtggagggc ctgcagttcg tggccgagaa gtatggcttc 252 0 

atcgacctga gccgagttgc catccatggc tggtcctacg ggggcttcct ctcgctcatg 2580 

gggctaatcc acaagcccca ggtgttcaag gtggccatcg cgggtgcccc ggtcaccgtc 264 0 

tggatggcct acgacacagg gtacactgag cgctacatgg acgtccctga gaacaaccag 2700 

cacggctatg aggcgggttc cgtggccctg cacgtggaga agctgcccaa tgagcccaac 2760 

cgcttgctta tcctccacgg cttcctggac gaaaacgtgc actttttcca cacaaacttc 2820 

ctcgtctccc aactgatccg agcagggaaa ccttaccagc tccagatcta ccccaacgag 2880 

agacacagta ttcgctgccc cgagtcgggc gagcactatg aagtcacgtt gctgcacttt 2940 
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r.^.n.c.a^^t acctctgagc ctgcccaccg ggagccgcca catcacagca caagtggctg 

igcS icsicis =i csrjg 
^"Hsc ar/cS^^^ ssscS r.sss 

llTaltlll cactglSag gccagaagag actcagagga gcgggctgcc ttccgcctgg 
cgctgatggg ^^^^SSja ^ ^ ^ ^ cggccagcca ccgtccccag cacccaagca 
ggctccctgt g^^^ctcag tcccctgg 99^^^9 tgtttgtgtt ttgtttgggg 

aStatt?tt cJta^tt^tt ?Iaaaiacag gccgggcgcg gtggctcacg tctgtaatcc 
^^^^=^h^ta aaaqqctgag gcgggcggat cacctgaggt tgggagttca agaccagcct 
alTcllclll lllllTcclc |tc?ctlcta aaaatacaaa aaattagccg ggtgtggtgg 
Igcgtgccta ?Stcccagc Lctcgggag gctgaggcag gagaatcgct tgaacccggg 
l^lfnalaat tQcgqtgagc caagatcgca ccattgcact ccagcctggg caacaagagc 
aSac?c?|t cIcSaltL atalaaaata aaagacagaa agcaaggggt 9^^^^^^^^ 
^^^llltl ?ccacaccgg gcagcggggt tgcaacccag cacctggtag gctccatttc 
agacttgggg ^ccacaccgg ^l^cgcaccg aacttccacg ctctgctggt 

"'r^Saac tPtcccctc? ?ca|cccagc cgcccagcc! catgtgtctg cctgacccgt 

i ^icS^i aSf^i 

iiii — ?2 =r4 r= ssf.^^. 

gttggaaaca tgtcacctg 

<210> 29 

<211> 832 

<212> PRT 

<213> Homo sapiens 

<400> 29 

Met Arg Lys Val Lys Lys l,eu Arg Leu Asp Lys Glu Asn Thr Gly Ser 
irp Arg Ser Phe Lr Leu Asn Ser Gin G^y Ala Glu Arg Met Ala Thr 
Thr Gly Thr llo Thr Ala Asp Arg Gly Asp Ala Ala Ala Thr Asp Asp 
Pro Ala ila Arg Phe Gin Val gL Lys His Ser Trp Asp Gly Leu Arg 
ser iL lie His Gly Ser I^g Lys Tyr Ser Gly Leu He val Asn Lys 

70 ' 
ila pro His ASP Phe Gin Phe Val Gin Lys Thr Asp Glu ser Gly Pro 

His ser His Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly Ser Arg Glu 

TOO 105 
Asn ser Leu Leu Tyr Ser Glu lie Pro Lys Lys Val Arg Lys Glu Ala 

Beu Leu III Leu Ser Trp Lys Gin Met Leu Asp His Phe Gin Ala Thr 



Tic 140 
130 -^^^ 



pro "is His Gly val Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg 
iys Arg Leu Gly Val Phe Gly He Thr Ser Tyr Asp Phe His Ser Glu 



165 170 



ser Gly Leu Phe Leu Phe Gin Ala Ser Asn Ser Leu Phe His Cys Arg 
ASP Gly Gly Lys Asn Gly Phe Met ial Ser Pro Met Lys Pro Leu Glu 
lie Lys ihr Gin Cys Ser Gly Pro Arg Met Asp Pro Lys He Cys Pro 
;.la III pro Ala Phe Phe Ser Phe He Asn Asn Ser Asp Leu Trp Val 



III Asn lie Glu Thr GlJ Glu Glu Arg Arg Leu Thr Phe Cys His Gin 



3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4219 



3" 
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245 



Gly 


Leu 


Ser 


Asn 


Val 


Leu 


Asp 


Asp 


Pro 






260 










265 


Phe 


val 


He 


Gin 


Glu 


Glu 


Phe 


Asp 


Arg 






275 










280 




Pro 


Thr 


Ala 


Ser 


Trp 


Glu 


Gly 


Ser 


Glu 




290 










295 






heu 


Tyr 


Glu 


Glu 


Val 


Asp 


Glu 


Ser 


Glu 


305 










310 








Ser 


Pro 


Ala 


Leu 


Glu 


Glu 


Arg 


Lys 


Thr 










325 










Thr 


Gly 


Ser 


Lys 


Asn 


Pro 


Lys 


He 


Ala 








340 










345 


Thr 


Asp 


Ser 


Gin 


Gly 


Lys 


He 


Val 


Ser 






355 










360 




Gin 


Pro 


Phe 


Ser 


Ser 


Leu 


Phe 


Pro 


Lys 




370 










375 






Gly 


Trp 


Thr 


Arg 


Asp 


Gly 


Lys 


Tyr 


Ala 


385 










390 








Pro 


Gin 


Gin 


Trp 


Leu 


Gin 


Leu 


Val 


Leu 










405 










Pro 


Ser 


Thr 


Glu 


Asn 


Glu 


Glu 


Gin 


Arg 








420 










425 


Pro 


Arg 


Asn 


Val 


Gin 


Pro 


Tyr 


Val 


Val 






435 










440 




Trp 


lie 


Asn 


val 


His 


Asp 


He 


Phe 


Tyr 


450 










455 






Glu 


Asp 


Glu 


Leu 


Cys 


Phe 


Leu 


Arg 


Ala 


465 










470 








Cys 


His 


Leu 


Tyr 


Lys 


Val 


Thr 


Ala 


Val 










485 










Trp 


Ser 


Glu 


Pro 


Phe 


Ser 


Pro 


Gly 


Glu 






500 










505 


Lys 


Glu 


Glu 


He 


Ala 


Leu 


Thr 


Ser 


Gly 




515 










520 




His 


Gly 


Ser 


Lys 


He 


Trp 


Val 


Asn 


Glu 




530 










535 






Gin 


Gly 


Thr 


Lys 


Asp 


Thr 


Pro 


Leu 


Glu 


545 










550 








Tyr 


Glu 


Ala 


Ala 


Gly 


Glu 


He 


Val 


Arg 










565 










His 


Ser 


Cys 


Ser 


Met 


Ser 


Gin 


Asn 


Phe 








580 










585 


Ser 


Ser 


Val 


Ser 


Thr 


Pro 


Pro 


Cys 


Val 






595 










600 




Pro 


Asp 


Asp 


Asp 


Pro 


Leu 


His 


Lys 


Gin 




610 










615 






Met 


Glu 


Ala 


Ala 


Ser 


Cys 


Pro 


Pro 


Asp 


625 










630 








His 


Phe 


His 


Thr 


Arg 


Ser 


Asp 


Val 


Arg 










645 










Pro 


His 


Ala 


Leu 


Gin 


Pro 


Gly 


Lys 


Lys 








660 










665 


Tyr 


Gly 


Gly 


Pro 


Gin 


val 


Gin 


Leu 


Val 






675 










680 




Lys 


Tyr 


Leu 


Arg 


Leu 


Asn 


Thr 


Leu 


Ala 




690 










695 






Val 


He 


Asp 


Gly Arg 


Gly 


Ser 


Cys 


Gin 


705 










710 








Ala 


Leu 


Lys 


Asn 


Gin 


Met 


Gly 


Gin 


Val 
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250 










255 




Lys 


Ser 


Ala 


Gly 


Val 


Ala 


Thr 










270 






Phe 


Thr 


Gly 


Tyr 


Trp 


Tirp 


Cys 








285 








Gly 


Leu 


Lys 


Thr 


Leu 


Arg 


He 






300 










Val 


Glu 


Val 


He 


His 


Val 


Pro 




315 










320 


Asp 


Ser 


Tyr 


Arg 


Tyr 


Pro 


Arg 


330 










335 




Leu 


Lys 


Leu 


Ala 


Glu 


Phe 


Gin 










350 






Thr 


Gin 


Glu 


Lys 


Glu 


Leu 


Val 








365 








val 


Glu 


Tyr 


He 


Ala 


Arg 


Ala 






380 










Trp 


Ala 


Met 


Phe 


Leu 


Asp 


Arg 




395 










400 


Leu 


Pro 


Pro 


Ala 


Leu 


Phe 


He 


410 










415 




Leu 


Ala 


Ser 


Ala 


Arg 


Ala 


Val 










430 






Tyr 


Glu 


Glu 


Val 


Thr 


Asn 


Val 








445 








Pro 


Phe 


Pro 


Gin 


Ser 


Glu 


Gly 






460 










Asn 


Glu 


Cys 


Lys 


Thr Gly 


Phe 




475 










480 


Leu 


Lys 


Ser 


Gin 


Gly 


Tyr 


Asp 


490 










495 




Asp 


Glu 


Phe 


Lys 


Cys 


Pro 


He 










510 






Glu 


Trp 


Glu 


Val 


Leu 


Ala 


Arg 








525 








Glu 


Thr 


Lys 


Leu 


Val 


Tyr 


Phe 






540 










His 


His 


Leu 


Tyr 


Val 


Val 


Ser 




555 










560 


Leu 


Thr 


Thr 


Pro 


Gly 


Phe 


Ser 


570 










575 




Asp 


Met 


Phe 


Val 


Ser 


His 


Tyr 










590 






His 


Val 


Tyr 


Lys 


Leu 


Ser 


Gly 








605 








Pro 


Arg 


Phe 


Trp 


Ala 


Ser 


Met 






620 










Tyr 


Val 


Pro 


Pro 


Glu 


He 


Phe 




635 










640 


Leu 


Tyr 


Gly 


Met 


He 


Tyr 


Lys 


650 










655 




His 


Pro 


Thr 


Val 


Leu 


Phe 


Val 










670 






Asn 


Asn 


Ser 


Phe 


Lys 


Gly 


He 








685 








Ser 


Leu 


Gly 


Tyr 


Ala 


Val 


Val 






700 










Arg 


Gly 


Leu 


Arg 


Phe 


Glu 


Gly 




715 










720 


Glu 


He 


Glu 


Asp 


Gin 


Val 


Glu 
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725 








Gly 


Leu 


Gin 


Phe 


Val 


Ala 


Glu 


Lys 






740 










Val 


Ala 


He 
755 


His 


Gly 


Trp 


Ser 


Tyr 
760 


Leu 


He 
770 


His 


Lys 


Pro 


Gin 


val 
775 


Phe 


Pro 


Arg 


Leu 


Pro 


Gly 


Arg 


Lys 


Arg 


785 










790 






Arg 


Leu 


Pro 


Thr 


Asp 


Pro 


Ser Arg 








805 








Pro 


Gin 


Arg 


Glu 
820 


Thr 


Gin 


Tyr 


Ser 



730 735 



Tyr 


Gly 


Phe 


X xe 


Asp 


Leu 






745 










•7 ^ n 






Gly 


Gly 


Phe 


Leu 


Ser 


Leu 




vj J. y 










76 5 








Lys 


Ala 


Gin 


Pro 


Leu 


Ala 


iyr 


IT J-^ 








780 










Ala 


Leu 


Phe 


Pro 


His 


Lys 


Leu 


Pro 






795 










800 


Glu 


Thr 


Leu 


Pro 


Ala 


Pro 


Asp 


Leu 




810 










815 




Leu 


Pro 


Arg 


val 


Gly Arg 


Ala 


Leu 


825 










830 







<210> 30 

<211> 4159 

<212> BNA 

<213> Homo sapiens 

<400> 30 



60 



caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 120 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 180 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 24 0 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 42 0 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 780 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 114 0 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 12 00 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 1260 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 13 80 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 1440 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 162 0 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 

aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 1920 

aaggacacgc cgctggagca ccacctctac gtggtcagct atgaggcggc cggcgagatc 1980 

gtacgcctca ccacgcccgg cttctcccat agctgctcca tgagccagaa cttcgacatg 2040 

ttcgtcagcc actacagcag cgtgagcacg ccgccctgcg tgcacgtcta caagctgagc 2100 

ggccccgacg acgaccccct gcacaagcag ccccgcttct gggctagcat gatggaggca 2160 

gccagctgcc ccccggatta tgttcctcca gagatcttcc atttccacac gcgctcggat 2220 

gtgcggctct acggcatgat ctacaagccc cacgccttgc agccagggaa gaagcacccc 2280 

accgtcctct ttgtatatgg aggcccccag gtgcagctgg tgaataactc cttcaaaggc 2340 

atcaagtact tgcggctcaa cacactggcc tccctgggct acgccgtggt tgtgattgac 2400 
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ggcaggggct 
caggtggaga 
atcgacctga 
gggctaatcc 
cctggacgaa 
agggaaacct 
gtcgggcgag 
cccaccggga 
ggagggactg 
gccccgagga 
gggttttatg 
ccctcctctc 
agaagagact 
cctggcccgg 
cagcctcccc 
aagacaggcc 
ggcggatcac 
tctactaaaa 
tcgggaggct 
gatcgcacca 
aaaaaCaaaa 
gcggggttgc 
atgcgggccc 
tcctgtcccc 
cagtggcggc 
acacaccagg 
tctccccttc 
aaaagaaaaa 
tgtcgattct 
gttggaaaca 



cctgtcagcg 
tcgaggacca 
gccgagttgc 
acaagcccca 
aacgtgcact 
taccagctcc 
cactatgaag 
gccgccacat 
agtggcccgc 
gccgctgcct 
tccgctgctt 
cccgccttct 
cagaggagcg 
ccagccaccg 
aacttgatgt 
gggcgcggtg 
ctgaggttgg 
atacaaaaaa 
gaggcaggag 
ttgcactcca 
gacagaaagc 
aacccagcac 
cacaggagaa 
aagctttcag 
tgtcccctcc 
ggttccgggg 
cctccccgcc 
aaaaaaaaag 
tattctttta 
tgtcacctg 



agggcttcgg 
ggtggagggc 
catccatggc 
ggtgttcaag 
ttttccacac 
agatctaccc 
tcacgttgct 
cacagcacaa 
gggccccagt 
tcaccgcccc 
cttggttgcc 
gggaggagga 
ggctgccttc 
tccccagcac 
ttgtgttttg 
gctcacgtct 
gagttcaaga 
ttagccgggt 
aatcgcttga 
gcctgggcaa 
aaggggtgcc 
ctggtaggct 
gcggccaggg 
gcaggcactg 
ccagcccagc 
ttgggagctg 
aagagctctg 
aaacaaacca 
taattatgcg 



ttcgaagggg 
ctgcagttcg 
tggtcctacg 
gcccaaccgc 
aaacttcctc 
caacgagaga 
gcactttcta 
gtggctgcag 
gaggcacttt 
gacgcctttt 
gagacagaga 
ggtcacacgc 
cgcctggggc 
ccaagcatgc 
tttgggggga 
gtaatcccag 
ccagcctggc 
gtggtggcgc 
acccgggagg 
caagagcgaa 
taaatctaga 
ccatttcttc 
cccgcggggg 
aaacgcaccg 
cgcccagcca 
aaccatcccc 
ccaggggcgg 
cctctacata 
tggaagaagt 



ccctgaaaaa 
tggccgagaa 

ggggcttcct 

ttgcttatcc 
gtctcccaac 
cacagtattc 
caggaatacc 
cctccgcggg 
gtcccgccca 
atcctttttt 
gatggtggtc 
tgatgggcac 
tccctgtgac 
aattgcctgt 
tatttttcat 
cactttggga 
caacatgggg 
gtgcctataa 
tggaggttgc 
actctgtctc 
cttggggtcc 
ccaagcccga 
gcaccacctg 
aacttccacg 
catgtgtctg 
acctcagggt 
gcaaaaaaaa 
ttatggaaag 
agacacatta 



ccaaatgggc 
gtatggcttc 
ctcgctcatg 
tccacggctt 
tgatccgagc 
gctgccccga 
tctgagcctg 
gaaccaggcg 
gcgctggcca 
aaacgctctt 
tcgggccagc 
tggagaggcc 
ctctcagtcc 
cccccccggc 
aattatttaa 
ggctgaggcg 
aaaccccgtc 
tcccagctac 
ggtgagccaa 
aaaataaata 
acaccgggca 
gcagagggtc 
tggacagccc 
ctctgctggt 
cctgacccgt 
tatatttccc 
aagtaaaaag 
aaaatatttt 
aacgattcca 



2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4159 



<210> 31 

<211> 832 

<212> PRT 

<213> Homo 

<400> 31 



sapiens 



Met 


Arg 


Lys 


val 


Lys 


Lys 


Leu 


Arg 


Leu 


Asp 


Lys 


Glu 


Asn 


Thr 


Gly 


Ser 


1 








5 










10 










15 




Trp 


Arg 


Ser 


Phe 


Ser 


Leu 


Asn 


Ser 


Glu 


Gly 


Ala 


Glu 


Arg 


Met 


Ala 


Thr 








20 










25 










30 






Thr 


Gly 


Thr 


Pro 


Thr 


Ala 


Asp 


Arg 


Gly 


Asp 


Ala 


Ala 


Ala 


Thr 


Asp 


Asp 






35 










40 










45 








Pro 


Ala 


Ala 


Arg 


Phe 


Gin 


Val 


Gin 


Lys 


His 


Ser 


Trp 


Asp 


Gly 


Leu 


Arg 




50 










55 










60 










Ser 


He 


He 


His 


Gly 


Ser 


Arg 


Lys 


Tyr 


Ser 


Gly 


Leu 


He 


val 


Asn 


Lys 


65 










70 










75 










80 


Ala 


Pro 


His 


Asp 


Phe 


Gin 


Phe 


Val 


Gin 


Lys 


Thr 


Asp 


Glu 


Ser 


Gly 


Pro 










85 










90 










95 




His 


Ser 


His 


Arg 


Leu 


Tyr 


Tyr 


Leu 


Gly 


Met 


Pro 


Tyr 


Gly 


Ser 


Arg 


Glu 








100 










105 










110 






Asn 


Ser 


Leu 


Leu 


Tyr 


Ser 


Glu 


He 


Pro 


Lys 


Lys 


Val 


Arg 


Lys 


Glu 


Ala 






115 










120 










125 








Leu 


Leu 


Leu 


Leu 


Ser 


Trp 


Lys 


Gin 


Met 


Leu 


Asp 


His 


Phe 


Gin 


Ala 


Thr 




130 










135 










140 










Pro 


His 


His 


Gly 


Val 


Tyr 


Ser 


Arg 


Glu 


Glu 


Glu 


Leu 


Leu 


Arg 


Glu 


Arg 


145 










150 










155 










160 


hys 


Arg 


Leu 


Gly 


Val 


Phe 


Gly 


He 


Thr 


Ser 


Tyr 


Asp 


Phe 


His 


Ser 


Glu 










165 










170 










175 




Ser 


Gly 


Leu 


Phe 


Leu 


Phe 


Gin 


Ala 


Ser 


Asn 


Ser 


Leu 


Phe 


His 


Cys 


Arg 
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180 185 190 

ASP Gly Gly Lys Asn Gly Phe Met Val Ser Pro Met Lys Pro Leu Glu 

195 200 205 

lie Lys Thr Gin cys Ser Gly Pro Arg Met Asp Pro Lys He Cys Pro 



215 220 
Ala isp pro Ala Phe Phe Ser Phe He Asn Asn Ser Asp Leu Trp Val 
t 230 235 240 

Ala Asn He Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe Cys His Gin 

245 250 
Gly Leu ser Asn Val Leu Asp Asp Pro Lys Ser Ala Gly Val Ala Thr 

260 265 270 

Phe val He Gin Glu Glu Phe Asp Arg Phe Thr Gly Tyr Trp Trp Cys 

280 28 5 

pro Thr Ala Ser Trp Glu Gly Ser Glu Gly Leu Lys Thr Leu Arg He 

290 295 300 

Leu Tyr Glu Glu Val Asp Glu Ser Glu Val Glu Val He His Val Pro 

ser pro Ala Leu Glu lln Arg Lys Thr Asp Ser Tyr Arg Tyr Pro Arg 

325 33° 
Thr Gly ser Lys Asn Pro Lys He Ala Leu Lys Leu Ala Glu Phe Gin 

340 345 350 

Thr ASP ser Gin Gly Lys He Val Ser Thr Gin Glu Lys Glu Leu Val 

Gin pro Phe Ser Ser Leu Phe Pro Lys Val Glu Tyr He Ala Arg Ala 
370 375 380 



Pro Gin Gin Trp Leu Gin Leu Val Leu Leu Pro Pro Ala Leu Phe He 



Gly Trp Thr Arg Asp Gly Lys Tyr Ala Trp Ala Met Phe Leu Asp Arg 

Pro pro Ala Leu Plie He 
415 

pro ser THr Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg Ala Val 

42 0 425 43 0 

pro Arg Asn Val Gin Pro Tyr Val Val Tyr Glu Glu Val Thr Asn Val 



405 410 



Trp He isn Val His Asp He Phe Tyr Pro Phe Pro Gin Ser Glu Gly 

Glu i^p Glu Leu cys Phe Leu Arg Ala Asn Glu Cys Lys Thr Gly Phe 

470 

cys His Leu Tyr Lys Val Thr Ala Val Leu Lys Ser Gin Gly Tyr Asp 

485 490 495 

Tm ser Glu Pro Phe Ser Pro Gly Glu Asp Glu Phe Lys Cys Pro He 

500 505 510 

LVE Glu Glu He Ala Leu Thr Ser Gly Glu Trp Glu Val Leu Ala Arg 

515 520 525 

His Gly Ser Lys He Trp Val Asn Glu Glu Thr Lys Leu val Tyr Phe 

535 540 
Gin lly Thr Lys Asp Thr Pro Leu Glu His His Leu Tyr Val Val Ser 

Tyr Glu Ala Ala Gly Glu He Val Arg Leu Thr Thr Pro Gly Phe Ser 

^ 565 570 575 

His ser cys Ser Met Ser Gin Asn Phe Asp Met Phe Val Ser His Tyr 

580 585 590 

ser ser Val Ser Thr Pro Pro Cys Val His Val Tyr Lys Leu Ser Gly 

pro Asp III Asp Pro Leu His Lys Gin Pro Arg Phe Trp Ala Ser Met 

filO 615 ^20 

Met Glu Ala Ala Ser Cys Pro Pro Asp Tyr Val Pro Pro Glu He Phe 

630 



440 



His Phe His Thr Arg Ser Asp Val Arg Leu Tyr Gly Net He Tyr Lys 
645 

pro His Ala Leu Gin Pro Gly Lys Lys His Pro Thr Val Leu Phe Val 
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660 



•T*vr 
lyj- 


Glv 


Gly 


Pro 


Gin 


Val 


Gin 


Leu 




675 










680 




Tvr 


Leu 


Arg 


Leu 


Asn 


Thr 


Leu 


690 










695 




Val 


lie 


ASP 


Gly 


Arg 


Gly 


Ser 


Cys 


705 










710 






Ala 


Leu 


Lys 


Asn 


Gin 


Met 


Gly 


Gin 










725 








GlV 


Leu 


Gin 


Phe 


Val 


Ala 


Glu 


Lys 






740 










Val 


Ala 


He 


His 


Gly 


Trp 


Ser 


Tyr 






755 










760 


Leu 


He 


His 


Lys 


Pro 


Gin 


val 


Phe 




770 










775 




Pro 


Arg 


Leu 


Pro 


Gly 


Arg 


Lys 


Arg 


785 








790 






Arg 


Leu 


Pro 


Thr 


Asp 


Pro 


Ser 


Arg 








805 








pro 


Gin 


Arg 


Glu 


Thr 


Gin 


Tyr 


Ser 



820 



665 










670 






Val 


Asn 


Asn 


Ser 


Phe 


Lys 


Gly 


He 










685 








Ala 


Ser 


Leu Gly 


Tyr 


Ala 


Val 


val 








700 










Gin 


Arg 


Gly Leu 


Arg 


Phe 


Glu 


Gly 






715 










720 


Val 


Glu 


He 


Glu 


Asp 


Gin 


Val 


Glu 




730 










735 




Tyr 


Gly 


Phe 


He 


Asp 


Leu 


Ser 


Arg 


745 










750 






Gly 


Gly 


Phe 


Leu 


Ser 


Leu 


Met 


Gly 










765 








Lys 


Ala 


Gin 


Pro 


Leu 


Ala 


Tyr 


Pro 








780 










Ala 


Leu 


Phe 


Pro 


His 


Lys 


Leu 


Pro 






795 










800 


Glu 


Thr 


Leu 


Pro 


Ala 


Pro 


Asp 


Leu 




810 










815 




Leu 


Pro 


Arg 


val 


Gly Arg 


Ala 


Leu 


825 










830 







<210> 32 

<211> 4076 

<212> DNA 

<213> Homo sapiens 

<400> 32 



60 
120 
180 



caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 
tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 
tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 24 0 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 360 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 42 0 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 48 0 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 54 0 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 600 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 66 0 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 78 0 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 84 0 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 96 0 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 1020 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 1080 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 114 0 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 1200 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 126 0 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 132 0 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 13 8 0 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 144 0 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 1560 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 162 0 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag i^nn 
ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 
gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 
tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 
aggcacggct ccaagatctg ggtcaatgag gagaccaagc tggtgtactt ccagggcacc 



1680 
1740 
1800 
1860 
1920 
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aaggacacgc 

gtacgcctca 

ttcgtcagcc 

ggccccgacg 

gccagctgcc 

gtgcggctct 

accgtcctct 

atcaagtact 

ggcaggggct 

caggtggaga 

atcgacctga 

gggctaatcc 

cctggacgaa 

agggaaacct 

gtcgggcgag 

cccaccggga 

ggagggactg 

gccccgagga 

gggttttatg 

ccctcctctc 

agaagagact 

cctggcccgg 

cagcctcccc 

aagacaggcc 

ggcggatcac 

tctactaaaa 

tcgggaggct 

gatcgcacca 

aaaaataaaa 

gcggggttgc 

ggcactgaaa 

gcccagccgc 

ggagctgaac 

agctctgcca 

caaaccacct 

ttatgcgtgg 
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cgctggagca 
ccacgcccgg 
actacagcag 
acgaccccct 
ccccggatta 
acggcatgat 
ttgtatatgg 
tgcggctcaa 
cctgtcagcg 
tcgaggacca 
gccgagttgc 
acaagcccca 
aacgtgcact 
taccagctcc 
cactatgaag 
gccgccacat 
agtggcccgc 
gccgctgcct 
tccgctgctt 
cccgccttct 
cagaggagcg 
ccagccaccg 
aacttgatgt 
gggcgcggtg 
ctgaggttgg 
atacaaaaaa 
gaggcaggag 
ttgcactcca 
gacagaaagc 
aacccagcac 
cgcaccgaac 
ccagccacat 
catccccacc 
ggggcgggca 
ctacatatta 
aagaagtaga 



ccacctctac 
cttctcccat 
cgtgagcacg 
gcacaagcag 
tgttcctcca 
ctacaagccc 
aggcccccag 
cacactggcc 
agggcttcgg 
ggtggagggc 
catccatggc 
ggtgttcaag 
ttttccacac 
agatctaccc 
tcacgttgct 
cacagcacaa 
gggccccagt 
tcaccgcccc 
cttggttgcc 
gggaggagga 
ggctgccttc 
tccccagcac 
ttgtgttttg 
gctcacgtct 
gagttcaaga 
ttagccgggt 
aatcgcttga 
gcctgggcaa 
aaggggtgcc 
ctggtaggct 
ttccacgctc 
gtgtctgcct 
tcagggttat 
aaaaaaaaag 



tggaaagaaa 
cacattaaac 



gtggtcagct 
agctgctcca 
ccgccctgcg 
ccccgcttct 
gagatcttcc 
cacgccttgc 
gtgcagctgg 
tccctgggct 
ttcgaagggg 
ctgcagttcg 
tggtcctacg 
gcccaaccgc 
aaacttcctc 
caacgagaga 
gcactttcta 
gtggctgcag 
gaggcacttt 
gacgcctttt 
gagacagaga 
ggtcacacgc 
cgcctggggc 
ccaagcatgc 
tttgggggga 
gtaatcccag 
ccagcctggc 
gtggtggcgc 
acccgggagg 
caagagcgaa 
taaatctaga 
ccatttcttc 
tgctggtcag 
gacccgtaca 
atttccctct 
taaaaagaaa 
atatttttgt 
gattccagtt 



atgaggcggc 
tgagccagaa 
tgcacgtcta 
gggctagcat 
atttccacac 
agccagggaa 
tgaataactc 
acgccgtggt 
ccctgaaaaa 
tggccgagaa 
ggggcttcct 
ttgcttatcc 
gtctcccaac 
cacagtattc 
caggaatacc 
cctccgcggg 
gtcccgccca 
atcctttttt 



gatggtggtc 
tgatgggcac 
tccctgtgac 
aattgcctgt 
tatttttcat 



cactttggga 
caacatgggg 
gtgcctataa 
tggaggttgc 
actctgtctc 
cttggggtcc 
ccaagcccga 
tggcggctgt 
caccaggggt 
ccccttccct 
agaaaaaaaa 
cgattcttat 
ggaaacatgt 



cggcgagatc 
cttcgacatg 
caagctgagc 
gatggaggca 
gcgctcggat 
gaagcacccc 
cttcaaaggc 
tgtgattgac 
ccaaatgggc 
gtatggcttc 
ctcgctcatg 
tccacggctt 
tgatccgagc 
gctgccccga 
tctgagcctg 
gaaccaggcg 
gcgctggcca 
aaacgctctt 
tcgggccagc 
tggagaggcc 
ctctcagtcc 
cccccccggc 
aattatttaa 
ggctgaggcg 
aaaccccgtc 
tcccagctac 
ggtgagccaa 
aaaataaata 
acaccgggca 
ctttcaggca 
cccctcccca 
tccggggttg 
ccccgccaag 
aaaaaagaaa 
tcttttataa 
cacctg 



1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4076 



<210> 
<211> 
<212> 
<213> 
<400> 



33 

879 

PRT 

Homo 

33 



Met 
1 

Trp 

Thr 

Pro 

Ser 

65 

Ala 

His 



Arg Lys 

Arg ser 

Gly Thr 

35 
Ala Ala 
50 

He He 
Pro His 
Ser His 



Asn Ser Leu 
115 

Leu Leu Leu 



sapiens 

Val Lys 
5 

Phe Ser 
20 

Pro Thr 

Arg Phe 

His Gly 

Asp Phe 
85 

Arg Leu 
100 

Leu Tyr 
Leu Ser 



Lys Leu Arg 

Leu Asn Ser 

Ala Asp Arg 
40 

Gin Val Gin 
55 

Ser Arg Lys 
70 

Gin Phe Val 

Tyr Tyr Leu 

Ser Glu He 
120 

Trp Lys Gin 



Leu Asp Lys Glu Asn Thr Gly Ser 

10 15 
Glu Gly Ala Glu Arg Met Ala Thr 
25 30 
Gly Asp Ala Ala Ala Thr Asp Asp 
45 

Lys His Ser Trp Asp Gly Leu Arg 
60 

Tyr Ser Gly Leu He Val Asn Lys 

75 80 
Gin Lys Thr Asp Glu Ser Gly Pro 

90 95 
Gly Met Pro Tyr Gly Ser Arg Glu 
105 110 
Pro Lys Lys Val Arg Lys Glu Ala 
125 

Met Leu Asp His Phe Gin Ala Thr 



43 
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130 135 



Pro 


His 


His 


Gly 


Val 


Tyr 


Ser 


Arg 


Glu 


145 










150 








Lys 


Arg 


Leu 


Gly 


Val 


Phe 


Gly 


He 


Thr 










165 










Ser 


Gly 


Leu 


Phe 


Leu 


Phe 


Gin 


Ala 


Ser 








leo 










185 


Asp 


Gly 


Gly 


Lys 


Asn 


Gly 


Phe 


Met 


Val 






195 










200 




He 


Lys 


Thr 


Gin 


Cys 


Ser 


Gly 


Pro 


Arg 




210 










215 






Ala 


Asp 


Pro 


Ala 


Phe 


Phe 


Ser 


Phe 


He 


225 










230 








Ala 


Asn 


He 


Glu 


Thr 


Gly 


Glu 


Glu 


Arg 










245 










Gly 


Leu 


Ser 


Asn 


Val 


Leu 


Asp 


Asp 


Pro 








260 










265 


Phe 


Val 


He 


Gin 


Glu 


Glu 


Phe 


Asp 


Arg 






275 










280 




Pro 


Thr 


Ala 


Ser 


Trp 


Glu 


Gly 


Ser 


Glu 




290 










295 






Leu 


Tyr 


Glu 


Glu 


Val 


Asp 


Glu 


Ser 


Glu 


305 










310 








Ser 


Pro 


Ala 


Leu 


Glu 


Glu 


Arg 


Lys 


Thr 










325 










Thr 


Gly 


Ser 


Lys 


Asn 


Pro 


Lys 


He 


Ala 








340 










345 


Thr 


Asp 


Ser 


Gin 


Gly 


Lys 


He 


Val 


Ser 






355 










360 




Gin 


Pro 


Phe 


Ser 


Ser 


Leu 


Phe 


Pro 


Lys 




370 










375 






Gly 


Trp 


Thr 


Arg 


Asp 


Gly 


Lys 


Tyr 


Ala 


385 










390 








Pro 


Gin 


Gin 


Trp 


Leu 


Gin 


Leu 


Val 


Leu 










405 










Pro 


Ser 


Thr 


Glu 


Asn 


Glu 


Glu 


Gin 


Arg 








420 










425 


Pro 


Arg 


Asn 


Val 


Gin 


Pro 


Tyr 


Val 


Val 






435 










440 




Trp 


He 


Asn 


Val 


His 


Asp 


He 


Phe 


Tyr 




450 










455 






Glu 


Asp 


Glu 


Leu 


Cys 


Phe 


Leu 


Arg 


Ala 


465 










470 








Cys 


His 


Leu 


Tyr 


Lys 


val 


Thr 


Ala 


Val 










485 










Trp 


Ser 


Glu 


Pro 


Phe 


Ser 


Pro 


Gly 


Glu 








500 










505 


Lys 


Glu 


Glu 


He 


Ala 


Leu 


Thr 


Ser 


Gly 






515 










520 




His 


Gly 


Ser 


Lys 


Gly 


Thr 


Lys 


Asp 


Thr 




530 










535 






Val 


Val 


Ser 


Tyr 


Glu 


Ala 


Ala 


Gly 


Glu 



545 550 
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140 










Glu 


Glu 


Leu 


Leu 


Arg 


Glu Arg 




155 










160 


Ser 


Tyr 


Asp 


Phe 


His 


Ser 


Glu 


170 










175 




Asn 


Ser 


Leu 


Phe 


His 


Cys 


Arg 










190 






Ser 


Pro 


Met 


Lys 


Pro 


Leu 


Glu 








205 








Met 


Asp 


Pro 


Lys 


He 


Cys 


Pro 






220 










Asn 


Asn 


Ser 


Asp 


Leu 


Trp 


Val 




235 










240 


Arg 


Leu 


Thr 


Phe 


Cys 


His 


Gin 


250 










255 




Lys 


Ser 


Ala 


Gly 


Val 


Ala 


Thr 










270 






Phe 


Thr 


Gly 


Tyr 


Trp 


Trp 


Cys 








285 








Gly 


Leu 


Lys 


Thr 


Leu 


Arg 


He 






300 










Val 


Glu 


Val 


He 


His 


Val 


Pro 




315 










320 


Asp 


Ser 


Tyr 


Arg 


Tyr 


Pro 


Arg 


330 










335 




Leu 


Lys 


Leu 


Ala 


Glu 


Phe 


Gin 










350 






Thr 


Gin 


Glu 


Lys 


Glu 


Leu 


Val 








365 








Val 


Glu 


Tyr 


He 


Ala 


Arg 


Ala 






380 










Trp 


Ala 


Met 


Phe 


Leu 


Asp Arg 




395 










400 


Leu 


Pro 


Pro 


Ala 


Leu 


Phe 


He 


410 










415 




Leu 


Ala 


Ser 


Ala 


Arg 


Ala 


Val 










430 






Tyr 


Glu 


Glu 


Val 


Thr 


Asn 


Val 








445 








Pro 


Phe 


Pro 


Gin 


Ser 


Glu 


Gly 






460 










Asn 


Glu 


Cys 


Lys 


Thr 


Gly 


Phe 




475 










480 


Leu 


Lys 


Ser 


Gin 


Gly 


Tyr 


Asp 


490 










495 




Asp 


Glu 


Phe 


Lys 


Cys 


Pro 


He 










510 






Glu 


Trp 


Glu 


Val 


Leu 


Ala 


Arg 








525 








Pro 


Leu 


Glu 


His 


His 


Leu 


Tyr 






540 










He 


val 


Arg 


Leu 


Thr 


Thr 


Pro 




555 










560 
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Glv 


Ptie 


Ser 


His 


Ser 


Cyg 


Ser 


Met 










565 








O JL 


His 


Tvr 
j-y J- 


Ser 


Ser 


Val 


Ser 


Thr 








58 0 










Leu. 


Ser 


Glv 


Pro 


Asp 


Asp 


Asp 


Pro 






s Q R 










6 00 


Ala 


Ser 


Met 


Met 


Glu 


Ala 


Ala 


Ser 




610 










615 




Glu 


lie 


Phe 


His 


Phe 


His 


Thr 


Arq 


62 5 










63 0 






lie 


J. J/ X 


Lys 


Pro 


His 


Ala 


Leu 


Gin 










645 








L6V1 


Phe 


Val 


Tyr Gly 


Glv 


Pro 


Gin 








660 












V3 X 


He 


Lys 


Tyr 


Leu 


Arg 


Leu 






675 










680 


Ala 


val 


Val 


Val 


He 


Asp 


Gly Arg 




tf; Q n 

D ? U 










695 




XT Xi W 


m n 


Gly Ala 


Leu 


Lys 


Asn 


Gin 


T ri c: 










Tin 






Gin 


Val 


Glu 


Gly 


Leu 




Phe 


Val 










725 








Ii€U 


ser 


Arg 


Val 


Ala 


X X c 


His 


Gly 








74 0 










Leu 


Met 


Gly 


Leu 


He 


rixS 


Lys 


Pro 






755 










760 


Gly Ala 


Pro 


Val 


Thr 


V d X 


Trp 


Met 




770 










775 




Arg 


Tyr 


Met 


Asp 


Val 


XT X Vj 


Glu 


Asn 


785 
















Ser 


Val 


Ala 


Leu 


His 


Val 


Glu 


Lys 










805 








Leu 


He 


Leu 


His 


Gly 




Leu 


Asp 








820 










Asn 


Phe 


Leu 


Val 


Ser 


Gin 


Leu 


He 






835 










840 


Gin 


He 


Tyr 


Pro 


Asn 


Glu 


Arg 


His 




850 










855 




Glu 


His 


Tyr 


Glu 


Val 


Thr 


Leu 


Leu 


865 










870 







Ser 


Gin 


Asn 


Phe 


Asp 


Met 


Phe 


Val 




570 














Pro 


Pro 


Cvs 


Val 


His 


Val 


Tvr 
xy X 


Lys 


— ' o 










O 2f \J 






Leu 


His 


Lys 


Gin 


Pro 


A TCI 




irp 










O U 3 












pT*r^ 
Jr X u 




lyr 


V d X 


Pro 


Pro 


















Ser 


Asp 


Val 


Arg 


Leu 


lyx 


vj -1- _y 








O >3 3 










U 


Pro 


Gly 


Lys 


Lys 


His 


Pro 


>1 XXX 


V d X 




rf^: t; n 

O D u 










f c c 

b b D 




Va 1 
V ctx 


G1 n 


Leu 


V CLX 


Asn 


Asn 


Ser 


PVi A 


O D3 










b/U 






Asn 


Thr 


Leu 


Ala 


C £3 -v- 

o isx 


T .^11 
XiC. LX 


vjrxy 


Tyr 










£ a c 

O O 3 








J; 


oc:x 


y a 


ai n 

VjXll 


Arg 


vaxy 


Leu 


Arg 








7 0 0 










Mot- 
net, 


Vaxy 


vjxn 


vax 


vjXU 


xxe 


(jXU 


Asp 






/ lo 










720 


xikXd 


X H 


Xjy o 








X xe 


Asp 




/ J U 










73 5 




irp 


Ser 


Tyr 


vrxy 


vaxy 


Fne 


Leu 


Ser 


/4 D 










750 






\j±Il 


vax 


fne 


Lys 


vax 


AX a 


xxe 


Axa 










765 








A±a 


Tyr 


Asp 


Thr 


Gly 


Tyr 


Thr 


Glu 








780 












m Ti 

^7XX1 


nx o 


vaXy 


Tyr 


V7XU 


/^xa 


wxy 






'7 Q C 










800 


Leu 


Pro 


Asn 


V7X U 


Pro 


Asn 


Arg 


Leu 




o XU 










815 




Glu 


Asn 


Val 


Xi. X o 






XIX o 


X XXX 


825 










830 






Arg 


Ala 


Gly 


Lys 


Pro 


Tyr 


Gin 


Leu 










845 








Ser 


He 


Arg 


Cys 


Pro 


Glu 


Ser 


Gly 








860 










His 


Phe 


Leu 


Gin 


Glu 


Tyr 


Leu 





875 



<210> 34 
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caggccgccg 
tgtgtccgcc 
tggagccgcg 
cccacgtccc 
gaaggcaccc 
cgcctggaca 
gagaggatgg 
gacccggccg 
cacggcagcc 
gtgcagaaga 
tatggcagcc 
gctctgctgc 
ggggtctact 



cctgggtcgc 
gcggctgtcg 
accgtgaggc 
ggtctgtgtc 
ctgccctcct 
aggagaacac 
ccaccaccgg 
cccgcttcca 
gcaagtactc 
cggatgagtc 
gagagaactc 
tcctgtcctg 
ctcgggagga 



tcaacttccg 
tcccccgctc 
gccgctggac 
ccacgcctgc 
gaggtcagct 
cggaagttgg 
gaccccaacg 
ggtgcagaag 
gggcctcatt 
tgggccccac 
cctcctctac 
gaagcagatg 
ggagctgctg 



ggtdaaaggt 
ccgccacttc 
ccgggacgac 
agctggaatg 
gagcggttaa 
agaagcttct 
gccgaccgag 
cactcgtggg 
gtcaacaagg 
tcccaccgcc 
tctgagattc 
ctggatcatt 
agggagcgga 



gcctgagccg 
cggggtcgca 
ctgcccagtc 
gaggctctct 
tgcggaaggt 
cgctgaattc 
gcgacgcagc 
^<=9g9ctccg 
cgccccacga 
tctactacct 
ccaagaaggt 
tccaggccac 
aacgcctggg 



gcgggtcccc 
gtcccgggca 
cggccgccgc 
ggacccttta 
taagaaactg 
cgagggggct 
cgccacagat 
gagcatcatc 
cttccagttt 
gggaatgcca 
ccggaaagag 
gccccaccat 
ggtcttcggc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 



1020 
1080 
1140 
1200 
1260 
1320 



1440 
1500 
1560 
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atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 
ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 
gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 960 
gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 
qaqcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 
gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 
tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 
gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 
aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 
ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 1380 
gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc i^/in 
cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 
gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 
tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 1620 
gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 
ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 1740 
gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 1800 
tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 
aqgcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 1920 
titgaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 1980 
atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 2040 
gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 2100 
tgggctagca tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 
catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 
cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 
gtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 
tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 
gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 
gtggccgaga agtatggctt catcgacctg agccgagttg ccatccatgg ctggtcctac 
gggggcttcc tctcgctcat ggggctaatc cacaagcccc aggtgttcaa ggtggccatc 
qcgggtgccc cggtcaccgt ctggatggcc tacgacacag ggtacactga gcgctacatg 
gacgtccctg agaacaacca gcacggctat gaggcgggtt ccgtggccct gcacgtggag 2700 
aagctgccca atgagcccaa ccgcttgctt atcctccacg gcttcctgga cgaaaacgtg 2760 
cactttttcc acacaaactt cctcgtctcc caactgatcc gagcagggaa accttaccag 2820 
ctccagatct accccaacga gagacacagt attcgctgcc ccgagtcggg cgagcactat 2880 
gaagtcacgt tgctgcactt tctacaggaa tacctctgag cctgcccacc gggagccgcc 2940 
acatcacagc acaagtggct gcagcctccg cggggaacca ggcgggaggg actgagtggc 3000 
ccgcgggccc cagtgaggca ctttgtcccg cccagcgctg gccagccccg aggagccgct 3060 
gccttcaccg ccccgacgcc ttttatcctt ttttaaacgc tcttgggttt tatgtccgct 3120 
gcttcttggt tgccgagaca gagagatggt ggtctcgggc cagcccctcc tctccccgcc 3180 
ttctgggagg aggaggtcac acgctgatgg gcactggaga ggccagaaga gactcagagg 3240 
agcgggctgc cttccgcctg gggctccctg tgacctctca gtcccctggc ccggccagcc 3300 
accgtcccca gcacccaagc atgcaattgc ctgtcccccc cggccagcct ccccaacttg 3360 
atgtttgtgt tttgtttggg gggatatttt tcataattat ttaaaagaca ggccgggcgc 3420 
ggtggctcac gtctgtaatc ccagcacttt gggaggctga ggcgggcgga tcacctgagg 3480 
ttgggagttc aagaccagcc tggccaacat ggggaaaccc cgtctctact aaaaatacaa 
aaaattagcc gggtgtggtg gcgcgtgcct ataatcccag ctactcggga ggctgaggca 
ggagaatcgc ttgaacccgg gaggtggagg ttgcggtgag ccaagatcgc accattgcac 
tccagcctgg gcaacaagag cgaaactctg tctcaaaata aataaaaaat aaaagacaga 
aagcaagggg tgcctaaatc tagacttggg gtccacaccg ggcagcgggg ttgcaaccca 
gcacctggta ggctccattt cttcccaagc ccgagcagag ggtcatgcgg gccccacagg 
agaagcggcc agggcccgcg gggggcacca cctgtggaca gccctcctgt ccccaagctt 
tcaggcaggc actgaaacgc accgaacttc cacgctctgc tggtcagtgg cggctgtccc 
ctccccagcc cagccgccca gccacatgtg tctgcctgac ccgtacacac caggggttcc 
ggggttggga gctgaaccat ccccacctca gggttatatt tccctctccc cttccctccc 
cqccaagagc tctgccaggg gcgggcaaaa aaaaaagtaa aaagaaaaga aaaaaaaaaa 414 0 
aaagaaacaa accacctcta catattatgg aaagaaaata tttttgtcga ttcttattct 4200 
tttataatta tgcgtggaag aagtagacac attaaacgat tccagttgga aacatgtcac 
ctg 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 



3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 



4200 
4260 
4263 
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<210> 35 

<211> 879 

<212> PRT 

<:213> Homo sapiens 

<400> 35 

Met Arq Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser 

1 5 10 15 

Trp Arg Ser Phe Ser Leu Asn Ser Glu Gly Ala Glu Arg Met Ala Thr 

20 25 30 

Thr Gly Thr Pro Thr Ala Asp Arg Gly Asp Ala Ala Ala Thr Asp Asp 

35 40 45 

Pro Ala Ala Arg Phe Gin Val Gin Lys His Ser Trp Asp Gly Leu Arg 

50 55 60 

Ser He He His Gly Ser Arg Lys Tyr Ser Gly Leu He Val Asn Lys 
65 70 75 80 

Ala Pro His Asp Phe Gin Phe Val Gin Lys Thr Asp Glu Ser Gly Pro 

85 90 55 

His ser His Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly Ser Arg Glu 

100 105 110 

Asn ser Leu Leu Tyr Ser Glu He Pro Lys Lys val Arg Lys Glu Ala 

115 120 125 

Leu Leu Leu Leu Ser Trp Lys Gin Met Leu Asp His Phe Gin Ala Thr 

130 135 140 

Pro His His Gly Val Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg 
145 150 155 160 

Lvs Arg Leu Gly Val Phe Gly He Thr Ser Tyr Asp Phe His Ser Glu 

165 170 175 

Ser Gly Leu Phe Leu Phe Gin Ala Ser Asn Ser Leu Phe His Cys Arg 

180 185 190 

ASP Gly Gly Lys Asn Gly Phe Met Val Ser Pro Met Lys Pro Leu Glu 

195 200 205 

He Lys Thr Gin Cys Ser Gly Pro Arg Met Asp Pro Lys He Cys Pro 

210 215 220 

Ala ASP Pro Ala Phe Phe ser Phe He Asn Asn Ser Asp Leu Trp Val 
225 230 235 240 

Ala Asn He Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe Cys His Gin 

245 250 255 

Gly Leu ser Asn Val Leu Asp Asp Pro Lys Ser Ala Gly Val Ala Thr 

260 265 270 

Phe Val He Gin Glu Glu Phe Asp Arg Phe Thr Gly Tyr Trp Trp Cys 

275 280 285 

Pro Thr Ala Ser Trp Glu Gly Ser Glu Gly Leu Lys Thr Leu Arg He 

290 295 300 

Leu Tyr Glu Glu Val Asp Glu Ser Glu Val Glu Val He His Val Pro 
305 310 315 320 

Ser Pro Ala Leu Glu Glu Arg Lys Thr Asp Ser Tyr Arg Tyr Pro Arg 

325 330 335 

Thr Gly Ser Lys Asn Pro Lys He Ala Leu Lys Leu Ala Glu Phe Gin 

340 345 350 

Thr Asp Ser Gin Gly Lys He Val Ser Thr Gin Glu Lys Glu Leu Val 

355 360 365 

Gin Pro Phe Ser Ser Leu Phe Pro Lys Val Glu Tyr He Ala Arg Ala 

370 375 380 

Glv Trp Thr Arg Asp Gly Lys Tyr Ala Trp Ala Met Phe Leu Asp Arg 
385 390 395 400 

Pro Gin Gin Trp Leu Gin Leu Val Leu Leu Pro Pro Ala Leu Phe He 
405 410 415 
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Pro Ser Thr Glu 
420 

Pro Arg Asn Val 
435 

Trp lie Asn Val 
450 

Glu Asp Glu Leu 
465 

Cys His Leu Tyr 

Trp Ser Glu Pro 
500 

Lys Glu Glu lie 
515 

His Gly Ser Lys 
530 

Val val Ser Tyr 
545 

Gly Phe Ser His 

Ser His Tyr Ser 
580 

Leu Ser Gly Pro 
595 

Ala Ser Met Met 
610 

Glu lie Phe His 
625 

lie Tyr Lys Pro 

Leu Phe Val Tyr 
660 

Lys Gly lie Lys 
675 

Ala val val Val 
690 

Phe Glu Gly Ala 
705 

Gin Val Glu Gly 

Leu Ser Arg Val 
740 

Leu Met Gly Leu 
755 

Gly Ala Pro Val 
770 

Arg Tyr Met Asp 
785 

Ser Val Ala Leu 

Leu lie Leu His 
820 

Asn Phe Leu Val 
835 

Gin lie Tyr Pro 
850 

Glu His Tyr Glu 
865 



Asn Glu Glu Gin 

Gin Pro Tyr Val 
440 

His Asp lie Phe 
455 

Cys Phe Leu Arg 
470 

Lys Val Thr Ala 
485 

Phe Ser Pro Gly 

Ala Leu Thr Ser 
520 

Gly Thr Lys Asp 
535 

Glu Ala Ala Gly 
550 

Ser Cys Ser Met 
565 

Ser Val Ser Thr 

Asp Asp Asp Pro 
600 

Glu Ala Ala Ser 
615 

Phe His Thr Arg 
630 

His Ala Leu Gin 
645 

Gly Gly Pro Gin 

Tyr Leu Arg Leu 
680 

lie Asp Gly Arg 
695 

Leu Lys Asn Gin 
710 

Leu Gin Phe Val 
725 

Ala lie His Gly 

He His Lys Pro 
760 

Thr val Trp Met 
775 

Val Pro Glu Asn 
790 

His Val Glu Lys 
805 

Gly Phe Leu Asp 

Ser Gin Leu He 
840 

Asn Glu Arg His 
855 

Val Thr Leu Leu 
870 



Arg Leu Ala Ser 
425 

val Tyr Glu Glu 

Tyr Pro Phe Pro 
460 

Ala Asn Glu Cys 
475 

Val Leu Lys Ser 
490 

Glu Asp Glu Phe 
505 

Gly Glu Trp Glu 

Thr Pro Leu Glu 
540 

Glu He Val Arg 
555 

Ser Gin Asn Phe 
570 

Pro Pro Cys Val 
585 

Leu His Lys Gin 

Cys Pro Pro Asp 
620 

Ser Asp Val Arg 
635 

Pro Gly Lys Lys 
650 

Val Gin Leu Val 
665 

Asn Thr Leu Ala 

Gly Ser Cys Gin 
700 

Met Gly Gin Val 
715 

Ala Glu Lys Tyr 
730 

Trp Ser Tyr Gly 
745 

Gin Val Phe Lys 

Ala Tyr Asp Thr 
780 

Asn Gin His Gly 
795 

Leu Pro Asn Glu 
810 

Glu Asn Val His 
825 

Arg Ala Gly Lys 

Ser He Arg Cys 
860 

His Phe Leu Gin 
875 



Ala Arg Ala Val 
430 

Val Thr Asn Val 
445 

Gin Ser Glu Gly 

Lys Thr Gly Phe 
480 

Gin Gly Tyr Asp 
495 

Lys Cys Pro He 
510 

Val Leu Ala Arg 
525 

His His Leu Tyr 

Leu Thr Thr Pro 
560 

Asp Met Phe Val 
575 

His Val Tyr Lys 
590 

Pro Arg Phe Trp 
605 

Tyr Val Pro Pro 

Leu Tyr Gly Met 
640 

His Pro Thr Val 
655 

Asn Asn Ser Phe 
670 

Ser Leu Gly Tyr 
685 

Arg Gly Leu Arg 

Glu He Glu Asp 
720 

Gly Phe He Asp 
735 

Gly Phe Leu Ser 
750 

Val Ala He Ala 
765 

Gly Tyr Thr Glu 

Tyr Glu Ala Gly 

800 

Pro Asn Arg Leu 
815 

Phe Phe His Thr 
830 

Pro Tyr Gin Leu 
845 

Pro Glu Ser Gly 
Glu Tyr Leu 
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<210> 36 

<211> 4180 

<212> DNA 

<213> Homo sapiens 

<400> 36 



caaqccqccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 
tafatccqcc gcg|c?gtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 
taSgccgcg accltglggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 
cclacgtccc ggtctitlL ccacgcctgc agctggaatg gaggctctct ggacccttta 
aaaqgcaccc Sgccltlct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 
?accSqaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 
aaQaggatgg cclccaccgg gIScclaacg gccgaccgag gcgacgcagc cgccacagat 
ScccSccI cccgcttcci ggtgcagaag cactcgtggg acgggctccg S-^^-tcatc 
racaacLcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 
ataclaaaqa cgga^gagtc ^Iggccccac tcccaccgcc tctactacct gggaatgcca 
?a?qg?aSc gISglactc cc?lctctac tctgagattc ccaagaaggt ccggaaagag 
gctc?gc?gc ?cctltcctg gaagcagatg ctggatcatt tccaggccac 9ccccaccat 
aaaat^tact ctcg|gagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 
!?c!cctcct acglc?t?ca Sgcgagagt ggcctcttcc tcttccaggc cagcaacagc 
ctcttccact gccgcgacgg cglcaagaac ggcttcatgg tgtcccctat gaaaccgctg 
aaaatcaaga Iccagtgclc agggccccgg atggacccca aaatctgccc tgccgaccct 
accttctSt ccttca?caa tllcagcgac ctgtgggtgg ccaacatcga gacaggcgag 
lagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ^cccaagtct 
gclSgtIg ccaccttcgt catacaggaa gagttcgacc gcttcactgg 9tactggtgg 
^a^ccclcag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 
Ittlllalll aqtccglggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 
Sqicqqac? StatcgSa Iccclggaca ggcagcaaga atcccaagat tgccttgaaa 
StSc^Sgt tccagac?ga cagccagggc aagatcgtct cgacccagga S^aggagctg 
atJcaqccS tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 
Sqqa?qgca aatacgcctg Igccatgttc ctggaccggc cccagcagtg gctccagctc 
q?cctcc!cc ccccglccc? i?tcatcccg agcacagaga atgaggagca gcggctagcc 
Ictqccagag ctgtclccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 
Itrtlallll atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 
ctctStttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 
qccSttaa aatlclaggg ctacgattgg agtgagccct tcagccccgg 99^^9^tgaa 
?ttSqtqcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 
aqqcacglct ccaagggcL caaggacacg ccgctggagc accacctcta cgtggtcagc 
tf?aaqq?qq ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 
atqagccagl ac??cgacat g?tcg?cagc cactacagca gcgtgagcac 9ccgccctgc 
^llrllllct acaaqltgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 
?gSt!gca tgat^ggc a|ccagc?gc cccccggatt atgttcctcc agagatcttc 
S??tcclca cgcgc?cgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 
caqccaqqgl Sagcaccc clccgtcctc tttgtatatg gaggccccca ggtgcagctg 
ataaaScJ ccttLaagg catcaagtac ttgcggctca acacactggc ctccctgggc 
?a?qccgtgg ttgtgattS cggcaggggc tcctgtcagc gagggcttcg 9ttcgaaggg 
gSc^tglaS accaaatgig ccaggtggag atcgaggacc aggtggaggg ^^tgcagttc 
qtqgccgaga agtatggctt catcgacctg agccgagttg ccatccatgg <=tg9tcctac 
^^^^ftnr tctcqctcat qgggctaatc cacaagcccc aggtgttcaa ggtggccatc 
qcaqqtqccc cggtcaccgt c?Satggcc tacgacacag ggtacactga gcgctacatg 
gaSScctg agaacaacca gclcggctat gaggcgggtt ccgtggccct 9cacgtggag 
ll7ctgcccl afgagcccaa ccgcttgctt atcctccacg gcttcctgga ^^S^aaacgtg 2760 
raftt^ttcc aclcaaactt cctcgtct , f^"??!^^ ^^^^ 

TillTcl^ ITo" "gfa Tlllccfcl gl/agccgcc 

KScacaS acaagtggct gcagcctccg cggggaacca ggcgggaggg actgagtggc 
ccQcqqqSc cagtiagca ?tttgtcccg cccagcgctg gccagccccg aggagccgct 
acct?Sccg cccciacgcc ttttatcctt ttttaaacgc tcttgggttt tatgtccgct 
Icttc'gg? tgcclagaca gagagatggt ggtctcgggc cagcccctcc tctccccgcc 



60 
120 
180 
240 
300 
360 
420 
480 
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600 
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720 
780 
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900 
960 
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1080 
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2940 
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ttctgggagg aggaggtcac acgctgatgg gcactggaga ggccagaaga gactcagagg 324 0 

agcgggctgc cttccgcctg gggctccctg tgacctctca gtcccctggc ccggccagcc 3300 

accgtcccca gcacccaagc atgcaattgc ctgtcccccc cggccagcct ccccaacttg 3360 

g^^g^-ttgtgt tttgtttggg gggatatttt tcataattat ttaaaagaca ggccgggcgc 3420 

ggtggctcac gtctgtaatc ccagcacttt gggaggctga ggcgggcgga tcacctgagg 3480 

ttgggagttc aagaccagcc tggccaacat ggggaaaccc cgtctctact aaaaatacaa 3 54 0 

aaaattagcc gggtgtggtg gcgcgtgcct ataatcccag ctactcggga ggctgaggca 3600 

ggagaatcgc ttgaacccgg gaggtggagg ttgcggtgag ccaagatcgc accattgcac 3660 

tccagcctgg gcaacaagag cgaaactctg tctcaaaata aataaaaaat aaaagacaga 3720 

aagcaagggg tgcctaaatc tagacttggg gtccacaccg ggcagcgggg ttgcaaccca 3780 

gcacctggta ggctccattt cttcccaagc ccgactttca ggcaggcact gaaacgcacc 3840 

gaacttccac gctctgctgg tcagtggcgg ctgtcccctc cccagcccag ccgcccagcc 3900 

acatgtgtct gcctgacccg tacacaccag gggttccggg gttgggagct gaaccatccc 3 960 

cacctcaggg ttatatttcc ctctcccctt ccctccccgc caagagctct gccaggggcg 4020 

ggcaaaaaaa aaagtaaaaa gaaaagaaaa aaaaaaaaaa gaaacaaacc acctctacat 4 080 

attatggaaa gaaaatattt ttgtcgattc ttattctttt ataattatgc gtggaagaag 4140 

tagacacatt aaacgattcc agttggaaac atgtcacctg 4180 

<210> 37 

<211> 819 

<212> PRT 

<213> Homo sapiens 

<400> 37 

Met Arg Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser 

15 10 15 

Trp Arg Ser Phe Ser Leu Asn Ser Glu Gly Ala Glu Arg Met Ala Thr 

20 25 30 

Thr Gly Thr Pro Thr Ala Asp Arg Gly Asp Ala Ala Ala Thr Asp Asp 

35 40 45 

Pro Ala Ala Arg Phe Gin Val Gin Lys His Ser Trp Asp Gly Leu Arg 

50 55 60 

Ser lie lie His Gly Ser Arg Lys Tyr Ser Gly Leu He Val Asn Lys 
65 70 75 80 

Ala Pro His Asp Phe Gin Phe Val Gin Lys Thr Asp Glu Ser Gly Pro 

85 90 95 

His Ser His Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly Ser Arg Glu 

100 105 110 

Asn Ser Leu Leu Tyr Ser Glu He Pro Lys Lys Val Arg Lys Glu Ala 

115 120 125 

Leu Leu Leu Leu Ser Trp Lys Gin Met Leu Asp His Phe Gin Ala Thr 

130 135 140 

Pro His His Gly Val Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg 
145 150 155 160 

Lvs Arcr Leu Gly Val Phe Gly He Thr Ser Tyr Asp Phe His Ser Glu 

165 170 175 

ser Gly Leu Phe Leu Phe Gin Ala Ser Asn Ser Leu Phe His Cys Arg 

180 185 190 

Asp Gly Gly Lys Asn Gly Phe Met Val Ser Pro Met Lys Pro Leu Glu 

195 200 205 

He Lys Thr Gin Cys Ser Gly Pro Arg Met Asp Pro Lys He Cys Pro 

210 215 220 

Ala Asp Pro Ala Phe Phe Ser Phe He Asn Asn Ser Asp Leu Trp Val 
225 230 235 240 

Ala Asn He Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe Cys His Gin 

245 250 255 

Gly Leu Ser Asn Val Leu Asp Asp Pro Lys Ser Ala Gly Val Ala Thr 
260 265 270 
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Phe Val He Gin 
275 

Pro Thr Ala Ser 
290 

Leu Tyr Glu Glu 
305 

Ser Pro Ala Leu 

Thr Gly Ser Lys 
340 

Thr Asp Ser Gin 
355 

Gin Pro Phe Ser 
370 

Gly Trp Thr Arg 
385 

Pro Gin Gin Trp 

Pro Ser Thr Glu 
420 

Pro Arg Asn Val 
435 

Trp He Asn Val 
450 

Glu Asp Glu Leu 
465 

Cys His Leu Tyr 

Trp Ser Glu Pro 
500 

Lys Glu Glu He 
515 

His Gly Ser Lys 
530 

Val Val Ser Tyr 
545 

Gly Phe Ser His 

Ser His Tyr Ser 
580 

Leu Ser Gly Pro 
595 

Ala Ser Met Met 
610 

Glu He Phe His 
625 

He Tyr Lys Pro 

Leu Phe Val Tyr 
660 

Lys Gly He Lys 
675 

Ala Val Val Val 
690 

Phe Glu Gly Ala 
705 

Gin Val Glu Gly 



Glu Glu Phe Asp 
280 

Trp Glu Gly Ser 
295 

Val Asp Glu Ser 
310 

Glu Glu Arg Lys 
325 

Asn Pro Lys He 

Gly Lys He Val 
360 

Ser Leu Phe Pro 
375 

Asp Gly Lys Tyr 
390 

Leu Gin Leu Val 
405 

Asn Glu Glu Gin 

Gin Pro Tyr Val 
440 

His Asp He Phe 
455 

Cys Phe Leu Arg 
470 

Lys Val Thr Ala 
485 

Phe Ser Pro Gly 

Ala Leu Thr Ser 
520 

Gly Thr Lys Asp 
535 

Glu Ala Ala Gly 
550 

Ser Cys Ser Met 
565 

Ser Val Ser Thr 

Asp Asp Asp Pro 
600 

Glu Ala Ala Ser 
615 

Phe His Thr Arg 
630 

His Ala Leu Gin 
645 

Gly Gly Pro Gin 

Tyr Leu Arg Leu 
680 

He Asp Gly Arg 
695 

Leu Lys Asn Gin 
710 

Leu Gin Phe Val 
725 



Arg Phe Thr Gly 

Glu Gly Leu Lys 
300 

Glu Val Glu Val 
315 

Thr Asp Ser Tyr 
330 

Ala Leu Lys Leu 
345 

Ser Thr Gin Glu 

Lys Val Glu Tyr 
380 

Ala Trp Ala Met 
395 

Leu Leu Pro Pro 
410 

Arg Leu Ala Ser 
425 

Val Tyr Glu Glu 

Tyr Pro Phe Pro 
460 

Ala Asn Glu Cys 
475 

Val Leu Lys Ser 
490 

Glu Asp Glu Phe 
505 

Gly Glu Trp Glu 

Thr Pro Leu Glu 
540 

Glu He val Arg 
555 

Ser Gin Asn Phe 
570 

Pro Pro Cys Val 
585 

Leu His Lys Gin 

Cys Pro Pro Asp 
620 

Ser Asp Val Arg 
635 

Pro Gly Lys Lys 
650 

Val Gin Leu Val 
665 

Asn Thr Leu Ala 

Gly Ser Cys Gin 
700 

Met Gly Gin Val 
715 

Ala Glu Lys Tyr 
730 



Tyr Trp Trp Cys 
285 

Thr Leu Arg He 

He His Val Pro 
320 

Arg Tyr Pro Arg 

335 

Ala Glu Phe Gin 
350 

Lys Glu Leu Val 
365 

He Ala Arg Ala 

Phe Leu Asp Arg 
400 

Ala Leu Phe He 
415 

Ala Arg Ala Val 
430 

Val Thr Asn Val 
445 

Gin Ser Glu Gly 

Lys Thr Gly Phe 
480 

Gin Gly Tyr Asp 
495 

Lys Cys Pro He 
510 

Val Leu Ala Arg 
525 

His His Leu Tyr 

Leu Thr Thr Pro 
560 

Asp Met Phe Val 
575 

His val Tyr Lys 
590 

Pro Arg Phe Trp 
605 

Tyr Val Pro Pro 

Leu Tyr Gly Met 
640 

His Pro Thr Val 
655 

Asn Asn Ser Phe 
670 

Ser Leu Gly Tyr 
685 

Arg Gly Leu Arg 

Glu He Glu Asp 
720 

Gly Phe He Asp 
735 
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Leu ser Arg Val Ala lie His Gly Trp Ser Tyr Gly Gly Phe Leu Ser 

740 745 750 

Leu Met Gly Leu He His Lys Pro Gin Val Phe Lys Ala Gin Pro Leu 

755 760 765 

Ala Tvr Pro Pro Arg Leu Pro Gly Arg Lys Arg Ala Leu Phe Pro His 

770 775 780 

Lys Leu Pro Arg Leu Pro Thr Asp Pro Ser Arg Glu Thr Leu Pro Ala 
785 790 795 800 

Pro Asp Leu Pro Gin Arg Glu Thr Gin Tyr ser Leu Pro Arg Val Gly 
805 810 815 

Arg Ala Leu 

<210> 38 

<211> 4120 

<212> DNA 

<213> Homo sapiens 

<400> 38 

caqqccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 
tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 
tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 
cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 
gaaggcaccc ctgccctcct gaggtcagct gagcggttaa tgcggaaggt taagaaactg 
cqcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 
gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 
gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 
cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 
gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 
tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 
gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 
ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 
atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 
ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 
qaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 
qccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 
Sagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 
gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 
tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 
gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 
aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 
ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 
gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 
cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 
qtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 
tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 
qtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 
ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 
gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 
tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 
aggcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 
ta?gaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 
atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 
gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 
Lggctagca tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 
catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 
cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 
qtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 
tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 
gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
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atqqccqaga agtatggctt catcgacctg agccgagttg ccatccatgg ctggtcctac 
|gSgc?t?c tctcglLat ggggctaatc cacaagcccc aggtgttcaa Sacccaaccg 
cSlcttatc ctccacggct tcctggacga aaacgtgcac tttttccaca caaacttcct 
cqtctcccaa ctgatccgag cagggaaacc ttaccagctc cagatctacc ccaacgagag 
acacagtatt cgctgcclci agtcgggcga gcactatgaa gtcacgttgc tgcactttct 
acaggaatac ctctgagcct gcccaccggg agccgccaca tcacagcaca agtggctgca 
gcctccgcgg ggaaccaggc gggagggact gagtggcccg cgggccccag tgaggcactt 
?qtcccgccc agcgctggcc agccccgagg agccgctgcc ttcaccgccc cgacgccttt 
tatccttttt taaacgctct tgggttttat gtccgctgct tcttggttgc cgagacagag 
aqatggtggt ctcgggccag cccctcctct ccccgccttc tgggaggagg aggtcacacg 
c?ga?iglca ctgiSaggc cagaagagac tcagaggagc gggctgcctt ccgcctgggg 
ctccctqtga cctctcagtc ccctggcccg gccagccacc gtccccagca cccaagcatg 
caattgccfg tcccccccgg ccagcctccc caacttgatg tttgtgtttt gtttgggggg 
atatttttca taattattta aaagacaggc cgggcgcggt ggctcacgtc tStaatccca 
qcactttggg aggctgaggc gggcggatca cctgaggttg ggagttcaag accagcctgg 3360 
?cJacatgS gSaac?ccgt ctctactaaa aatacaaaaa attagccggg tgtggtggcg 3420 
cgJgcctSa Itcccagcta ctcgggaggc tgaggcagga gaatcgcttg aacccgggag 3480 



Lttgglgtc cacaccgggc agcggggttg caacccagca cctggtaggc tccatttctt 
cccaagcccg agcagagggt catgcgggcc ccacaggaga agcggccagg gcccgcgggg 
aacacScct gtggacagcc ctcctgtccc caagctttca ggcaggcact gaaacgcacc 
laaSttccac Icfltgctgg tcagtggcgg ctgtcccctc cccagcccag ccgcccagcc 
Icatgtgtct gcctgacccg tacacaccag gggttccggg gttgggagct gaaccatccc 
cacc?Sggg ftatatttcc ctctcccctt ccctccccgc caagagctct gccaggggcg 
ggcaaailS aaagtaaaaa gaaaagaaaa aaaaaaaaaa gaaacaaacc acctctacat 
a?tatggaaa gaaaatattt ttgtcgattc ttattctttt ataattatgc gtggaagaag 
tagacacatt aaacgattcc agttggaaac atgtcacctg 



<210> 39 

<211> 819 

<212> PRT 

<213> Homo sapiens 

<400> 39 



Met Arg Lys Val Lys Lys Leu Arg Leu Asp Lys Glu Asn Thr Gly Ser 
Trp Arg ser Phe Ser Leu Asn Ser Glu Gly Ala Glu Arg Met Ala Thr 



Thr Gly Thr Pro Thr Ala Asp Arg Gly Asp Ala Ala Ala Thr Asp Asp 

35 40 
Pro Ala Ala Arg Phe Gin Val Gin Lys His Ser Trp Asp Gly Leu Arg 

ser lie He His Gly Ser Arg Lys Tyr Ser Gly Leu He Val Asn Lys 

Ila pro His Asp Phe lln Phe Val Gin Lys Thr Asp Glu Ser Gly Pro 

85 ^0 

His Ser His Arg Leu Tyr Tyr Leu Gly Met Pro Tyr Gly Ser Arg Glu 

100 105 110 

Asn Ser Leu Leu Tyr Ser Glu He Pro Lys Lys Val Arg Lys Glu Ala 

3_15 120 125 

Leu Leu Leu Leu Ser Trp Lys Gin Met Leu Asp His Phe Gin Ala Thr 

pro His His Gly val Tyr Ser Arg Glu Glu Glu Leu Leu Arg Glu Arg 

145 150 1" . 

LYS Arg Leu Gly Val Phe Gly He Thr Ser Tyr Asp Phe His Ser Glu 

165 170 175 

ser Gly Leu Phe Leu Phe Gin Ala Ser Asn Ser Leu Phe His Cys Arg 
180 185 190 



2520 
2580 
2640 
2700 
2760 
2820 
2680 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 



3480 
3540 



aEaaaqg^ cggtgagcca agatcgcacc attgcactcc agcctgggca acaagagcga 3540 
aa??J?gtc? cILataaat aaaaaataaa agacagaaag caaggggtgc ctaaatctag 3600 
aaccccgcct ^-ddci u r.r-^orI^aaac tccatttctt 3660 



3720 
3780 
3840 
3900 
3960 
4020 
4080 
4120 
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AsD Gly Gly Lys Asn Gly Phe Met Val Ser Pro Met Lys Pro Leu Glu 

^ 195 200 205 

lie Lys Thr Gin Cys Ser Gly Pro Arg Met Asp Pro Lys lie Cys Pro 

210 215 220 

Ala ASP Pro Ala Phe Phe Ser Phe lie Asn Asn Ser Asp Leu Trp Val 
225 230 235 240 

Ala Asn lie Glu Thr Gly Glu Glu Arg Arg Leu Thr Phe Cys His Gin 

245 250 255 

Glv Leu Ser Asn val Leu Asp Asp Pro Lys Ser Ala Gly Val Ala Thr 

260 265 270 

Phe Val He Gin Glu Glu Phe Asp Arg Phe Thr Gly Tyr Trp Trp Cys 

275 280 285 

pro Thr Ala Ser Trp Glu Gly Ser Glu Gly Leu Lys Thr Leu Arg He 

290 295 300 

Leu Tvr Glu Glu Val Asp Glu Ser Glu Val Glu Val lie His Val Pro 
305 310 315 320 

ser Pro Ala Leu Glu Glu Arg Lys Thr Asp Ser Tyr Arg Tyr Pro Arg 

325 330 335 

Thr Gly Ser Lys Asn Pro Lys He Ala Leu Lys Leu Ala Glu Phe Gin 

340 345 350 

Thr Asp ser Gin Gly Lys He Val Ser Thr Gin Glu Lys Glu Leu Val 

355 360 365 

Gin Pro Phe Ser Ser Leu Phe Pro Lys Val Glu Tyr He Ala Arg Ala 

370 375 380 

Gly Trp Thr Arg Asp Gly Lys Tyr Ala Trp Ala Met Phe Leu Asp Arg 
385 390 395 400 

Pro Gin Gin Trp Leu Gin Leu Val Leu Leu Pro Pro Ala Leu Phe He 

405 410 415 

pro ser Thr Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg Ala Val 

420 425 430 

pro Arg Asn Val Gin Pro Tyr Val Val Tyr Glu Glu Val Thr Asn Val 

435 440 445 

Trp He Asn Val His Asp He Phe Tyr Pro Phe Pro Gin Ser Glu Gly 

450 455 460 

Glu ASP Glu Leu cys Phe Leu Arg Ala Asn Glu Cys Lys Thr Gly Phe 
465 470 475 480 

CVS His Leu Tyr Lys Val Thr Ala Val Leu Lys ser Gin Gly Tyr Asp 

485 490 495 

Trp ser Glu Pro Phe Ser Pro Gly Glu Asp Glu Phe Lys Cys Pro He 

500 505 510 

Lys Glu Glu He Ala Leu Thr Ser Gly Glu Trp Glu Val Leu Ala Arg 

515 520 525 

His Gly Ser Lys Gly Thr Lys Asp Thr Pro Leu Glu His His Leu Tyr 

530 535 540 

val val Ser Tyr Glu Ala Ala Gly Glu He Val Arg Leu Thr Thr Pro 
545 550 555 560 

Glv Phe Ser His Ser Cys Ser Met Ser Gin Asn Phe Asp Met Phe Val 

565 570 575 

Ser His Tyr Ser Ser Val Ser Thr Pro Pro Cys Val His Val Tyr Lys 

580 585 590 

Leu ser Gly Pro Asp Asp Asp Pro Leu His Lys Gin Pro Arg Phe Trp 

595 600 605 

Ala ser Met Met Glu Ala Ala Ser Cys Pro Pro Asp Tyr Val Pro Pro 

610 615 620 

Glu He Phe His Phe His Thr Arg Ser Asp Val Arg Leu Tyr Gly Met 
625 630 635 640 

He Tvr Lys Pro His Ala Leu Gin Pro Gly Lys Lys His Pro Thr Val 
645 650 655 
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Leu Phe val Tyr Gly Gly Pro Gin Val Gin Leu Val Asn Asn Ser Phe 

660 665 670 

Lys Gly He Lys Tyr Leu Arg Leu Asn Thr Leu Ala Ser Leu Gly Tyr 

675 680 685 

Ala Val Val Val He Asp Gly Arg Gly Ser Cys Gin Arg Gly Leu Arg 

690 695 700 

Phe Glu Gly Ala Leu Lys Asn Gin Met Gly Gin Val Glu He Glu Asp 
705 710 715 720 

Gin Val Glu Gly Leu Gin Phe Val Ala Glu Lys Tyr Gly Phe He Asp 

725 730 735 

Leu Ser Arg Val Ala He His Gly Trp Ser Tyr Gly Gly Phe Leu Ser 

740 745 750 

Leu Met Gly Leu He His Lys Pro Gin Val Phe Lys Ala Gin Pro Leu 

755 760 765 

Ala Tyr Pro Pro Arg Leu Pro Gly Arg Lys Arg Ala Leu Phe Pro His 

770 775 780 

Lys Leu Pro Arg Leu Pro Thr Asp Pro Ser Arg Glu Thr Leu Pro Ala 



Arg Ala Leu 

<210> 40 

<211> 4037 

<212> DNA 

<213> Homo sapiens 

<400> 40 

caggccgccg cctgggtcgc tcaacttccg ggtcaaaggt gcctgagccg gcgggtcccc 60 

tgtgtccgcc gcggctgtcg tcccccgctc ccgccacttc cggggtcgca gtcccgggca 12 0 

tggagccgcg accgtgaggc gccgctggac ccgggacgac ctgcccagtc cggccgccgc 18 0 

cccacgtccc ggtctgtgtc ccacgcctgc agctggaatg gaggctctct ggacccttta 240 

gaaggcaccc ctgccctcct gaggtcagct gagcggttaa • tgcggaaggt taagaaactg 300 

cgcctggaca aggagaacac cggaagttgg agaagcttct cgctgaattc cgagggggct 3 60 

gagaggatgg ccaccaccgg gaccccaacg gccgaccgag gcgacgcagc cgccacagat 420 

gacccggccg cccgcttcca ggtgcagaag cactcgtggg acgggctccg gagcatcatc 480 

cacggcagcc gcaagtactc gggcctcatt gtcaacaagg cgccccacga cttccagttt 540 

gtgcagaaga cggatgagtc tgggccccac tcccaccgcc tctactacct gggaatgcca 6 00 

tatggcagcc gagagaactc cctcctctac tctgagattc ccaagaaggt ccggaaagag 660 

gctctgctgc tcctgtcctg gaagcagatg ctggatcatt tccaggccac gccccaccat 720 

ggggtctact ctcgggagga ggagctgctg agggagcgga aacgcctggg ggtcttcggc 78 0 

atcacctcct acgacttcca cagcgagagt ggcctcttcc tcttccaggc cagcaacagc 840 

ctcttccact gccgcgacgg cggcaagaac ggcttcatgg tgtcccctat gaaaccgctg 900 

gaaatcaaga cccagtgctc agggccccgg atggacccca aaatctgccc tgccgaccct 96 0 

gccttcttct ccttcatcaa taacagcgac ctgtgggtgg ccaacatcga gacaggcgag 102 0 

gagcggcggc tgaccttctg ccaccaaggt ttatccaatg tcctggatga ccccaagtct 108 0 

gcgggtgtgg ccaccttcgt catacaggaa gagttcgacc gcttcactgg gtactggtgg 114 0 

tgccccacag cctcctggga aggttcagag ggcctcaaga cgctgcgaat cctgtatgag 120 0 

gaagtcgatg agtccgaggt ggaggtcatt cacgtcccct ctcctgcgct agaagaaagg 126 0 

aagacggact cgtatcggta ccccaggaca ggcagcaaga atcccaagat tgccttgaaa 1320 

ctggctgagt tccagactga cagccagggc aagatcgtct cgacccagga gaaggagctg 13 8 0 

gtgcagccct tcagctcgct gttcccgaag gtggagtaca tcgccagggc cgggtggacc 144 0 

cgggatggca aatacgcctg ggccatgttc ctggaccggc cccagcagtg gctccagctc 1500 

gtcctcctcc ccccggccct gttcatcccg agcacagaga atgaggagca gcggctagcc 156 0 

tctgccagag ctgtccccag gaatgtccag ccgtatgtgg tgtacgagga ggtcaccaac 162 0 

gtctggatca atgttcatga catcttctat cccttccccc aatcagaggg agaggacgag 1680 

ctctgctttc tccgcgccaa tgaatgcaag accggcttct gccatttgta caaagtcacc 174 0 

gccgttttaa aatcccaggg ctacgattgg agtgagccct tcagccccgg ggaagatgaa 180 0 

tttaagtgcc ccattaagga agagattgct ctgaccagcg gtgaatggga ggttttggcg 1860 



>5 
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aggcacggct ccaagggcac caaggacacg ccgctggagc accacctcta cgtggtcagc 1920 

tatgaggcgg ccggcgagat cgtacgcctc accacgcccg gcttctccca tagctgctcc 1980 

atgagccaga acttcgacat gttcgtcagc cactacagca gcgtgagcac gccgccctgc 2040 

gtgcacgtct acaagctgag cggccccgac gacgaccccc tgcacaagca gccccgcttc 2100 

tgggctagca tgatggaggc agccagctgc cccccggatt atgttcctcc agagatcttc 2160 

catttccaca cgcgctcgga tgtgcggctc tacggcatga tctacaagcc ccacgccttg 2220 

cagccaggga agaagcaccc caccgtcctc tttgtatatg gaggccccca ggtgcagctg 2280 

gtgaataact ccttcaaagg catcaagtac ttgcggctca acacactggc ctccctgggc 2340 

tacgccgtgg ttgtgattga cggcaggggc tcctgtcagc gagggcttcg gttcgaaggg 2400 

gccctgaaaa accaaatggg ccaggtggag atcgaggacc aggtggaggg cctgcagttc 2460 

gtggccgaga agtatggctt catcgacctg agccgagttg ccatccatgg ctggtcctac 2520 

gggggcttcc tctcgctcat ggggctaatc cacaagcccc aggtgttcaa ggcccaaccg 2580 

cttgcttatc ctccacggct tcctggacga aaacgtgcac tttttccaca caaacttcct 2640 

cgtctcccaa ctgatccgag cagggaaacc ttaccagctc cagatctacc ccaacgagag 2700 

acacagtatt cgctgccccg agtcgggcga gcactatgaa gtcacgttgc tgcactttct 2760 

acaggaatac ctctgagcct gcccaccggg agccgccaca tcacagcaca agtggctgca 2820 

gcctccgcgg ggaaccaggc gggagggact gagtggcccg cgggccccag tgaggcactt 2880 

tgtcccgccc agcgctggcc agccccgagg agccgctgcc ttcaccgccc cgacgccttt 2940 

tatccttttt taaacgctct tgggttttat gtccgctgct tcttggttgc cgagacagag 3000 

agatggtggt ctcgggccag cccctcctct ccccgccttc tgggaggagg aggtcacacg 3060 

ctgatgggca ctggagaggc cagaagagac tcagaggagc gggctgcctt ccgcctgggg 312 0 

ctccctgtga cctctcagtc ccctggcccg gccagccacc gtccccagca cccaagcatg 3180 

caattgcctg tcccccccgg ccagcctccc caacttgatg tttgtgtttt gtttgggggg 3240 

atatttttca taattattta aaagacaggc cgggcgcggt ggctcacgtc tgtaatccca 33 00 

gcactttggg aggctgaggc gggcggatca cctgaggttg ggagttcaag accagcctgg 3 3 60 

ccaacatggg gaaaccccgt ctctactaaa aatacaaaaa attagccggg tgtggtggcg 3420 

cgtgcctata atcccagcta ctcgggaggc tgaggcagga gaatcgcttg aacccgggag 34 80 

gtggaggttg cggtgagcca agatcgcacc attgcactcc agcctgggca acaagagcga 3540 

aactctgtct caaaataaat aaaaaataaa agacagaaag caaggggtgc ctaaatctag 3600 

acttggggtc cacaccgggc agcggggttg caacccagca cctggtaggc tccatttctt 3660 

cccaagcccg actttcaggc aggcactgaa acgcaccgaa cttccacgct ctgctggtca 3 72 0 

gtggcggctg tcccctcccc agcccagccg cccagccaca tgtgtctgcc tgacccgtac 3780 

acaccagggg ttccggggtt gggagctgaa ccatccccac ctcagggtta tatttccctc 3 840 

tcccctt ccc tccccgccaa gagctctgcc aggggcgggc aaaaaaaaaa gtaaaaagaa 3 900 

aagaaaaaaa aaaaaaagaa acaaaccacc tctacatatt atggaaagaa aatatttttg 3 960 

tcgattctta ttcttttata attatgcgtg gaagaagtag acacattaaa cgattccagt 4020 

tggaaacatg tcacctg 4037 

<210> 41 

<211> 706 

<212> PRT 

<213> Homo sapiens 

<400> 41 

Asp Thr Asp val Val Tyr Lys Ser Glu Asn Gly His Val lie Lys Leu 

15 10 15 

Asn lie Glu Thr Asn Ala Thr Thr Leu Leu Leu Glu Asn Thr Thr Phe 

20 25 30 

Val Thr Phe L-ys Ala Ser Arg His Ser Val Ser Pro Asp Leu Lys Tyr 

35 40 45 

Val Leu Leu Ala Tyr Asp Val Lys Gin He Phe His Tyr Ser Tyr Thr 

50 55 60 

Ala ser Tyr Val He Tyr Asn He His Thr Arg Glu Val Trp Glu Leu 
65 70 75 80 

Asn Pro Pro Glu Val Glu Asp Ser Val Leu Gin Tyr Ala Ala Trp Gly 

85 90 95 

Val Gin Gly Gin Gin Leu He Tyr He Phe Glu Asn Asn He Tyr Tyr 
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Gin Pro Asp 
115 

Glu Glu lie 

130 
Leu Leu His 
145 

Leu Ala Phe 

Pro Arg Phe 

Pro Lys Ala 
195 

Leu Tyr Gly 

210 
Lys Ser Arg 
225 

Lys Thr Val 

Thr Val Cys 

Thr Ser Asp 
275 

Arg Asp Gly 

290 
Arg Gly Glu 
305 

Glu Gin He 

Lys He Leu 

Thr Glu Ser 
355 

Gly Leu Leu 
370 

Cys Thr Tyr 
385 

Leu Phe Cys 

Asp Asn Pro 

Glu Ala He 
435 

His He Asp 

450 
Phe Met Asp 
465 

Pro Gly Gly 

Val Leu He 

Gly Ser Gly 
515 

Leu Gly Ser 
530 

Leu Lys Leu 
545 

Gly Tyr Gly 



He 


Lys 


Ser 


Ser 


Ser 
120 


Leu 


Arg 


He 


Phe 


Asn 


Gly 

135 


He 


Ala 


Asp 


Ser 


His 


He 
150 


Ala 


His 


Trp 


Trp 


Leu 


Met 
165 


He 


Asn 


Asp 


Ser 


Leu 
170 


Thr 


Gly 


Ala 


Leu 


Tyr 


Pro 


Lys 


180 










185 




Gly 


Gin 


Val 


Asn 


Pro 
200 


Thr 


He 


Pro 


Thr 


His 


Thr 
215 


Leu 


Glu 


Leu 


Glu 


Tyr 


Tyr 
230 


He 


Thr 


Met 


Val 


Val 


Arg 
245 


Trp 


Leu 


Asn 


Arg 


Pro 
250 


Glu 


Thr 


Thr 


Thr 


Gly 


Ala 


Cys 


260 










265 




Thr 


Trp 


Leu 


Ser 


Gin 
280 


Gin 


Asn 


Ser 


Lys 


Phe 


Phe 
295 


Met 


Thr 


Val 


Phe 


His 


His 
310 


He 


Ala 


Met 


Phe 


Thr 


Val 
325 


Arg 


His 


Leu 


Thr 


Ser 
330 


Ala 


Tyr 


Asp 


Glu 


Thr 


Thr 


Gin 


340 










345 




Ser 


Pro 


Arg 


Gly 


Arg 
360 


Gin 


Leu 


Asn 


Arg 


Gin 


Cys 
375 


He 


Ser 


Cys 


Phe 


Asp 


Ala 
390 


Ser 


Phe 


Ser 


Pro 


Glu 


Gly 
405 


Pro 


Arg 


Val 


Pro 


Val 
410 


Ala 


Lys 


Tyr 


Phe 


He 


Leu 


Glu 


420 










425 




Leu 


Lys 


Lys 


Lys 


He 
440 


Gly 


Lys 


Asp 


Tyr 


Glu 


Leu 
455 


Pro 


Leu 


Gin 


Arg 


Asn 


Gin 
470 


Tyr 


Ala 


Leu 


Leu 


Gin 


Leu 
485 


Val 


Thr 


Asp 


Lys 


Phe 
490 


Asp 


Met 


Asp 


Asn 


Val 


He 


Val 


500 










505 




Phe 


Gin 


Gly 


Leu 


Lys 
520 


He 


Leu 


Val 


Glu 


Val 


Lys 
535 


Asp 


Gin 


He 


Pro 


Tyr 


He 
550 


Asp 


Ser 


Lys 


Arg 


Gly 


Tyr 
565 


He 


Ala 


Ser 


Met 


He 
570 
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Leu 


Thr 


Ser 


Ser 


Gly 


Lys 






125 








Trp 


Leu 


Tyr 


Glu 


Glu 


Glu 




140 










Ser 


Pro 


Asp 


Gly 


Glu 


Arg 


155 










160 


Val 


Pro 


Thr 


Met 


Val 


He 










175 




Gly 


Lys 


Gin 


Tyr 


Pro 


Tyr 








190 






Lys 


Leu 


Tyr 


Val 


Val 


Asn 






205 








Met 


Pro 


Pro 


Asp 


Ser 


Phe 




220 










Lys 


Trp 


Val 


Ser 


Asn 


Thr 


235 










240 


Gin 


Asn 


He 


Ser 


He 


Leu 










255 




Ser 


Lys 


Lys 


Tyr 


Glu 


Met 








270 






Glu 


Glu 


Pro 


Val 


Phe 


Ser 






285 








Pro 


Val 


Lys 


Gin 


Gly 


Gly 




300 










Leu 


He 


Gin 


Ser 


Lys 


Ser 


315 










320 


Gly 


Asn 


Trp 


Glu 


Val 


He 










335 




Lys 


He 


Tyr 


Phe 


Leu 


Ser 








350 






Tyr 


Ser 


Ala 


Ser 


Thr 


Glu 






365 








Asn 


Phe 


Met 


Lys 


Glu 


Gin 




380 










Met 


Asn 


Gin 


His 


Phe 


Leu 


395 










400 


Val 


Ser 


Leu 


His 


Ser 


Thr 










415 




Ser 


Asn 


Ser 


Met 


Leu 


Lys 








430 






Pro 


Glu 


He 


Lys 


He 


Leu 






445 








Leu 


Ser 


Leu 


Pro 


Lys 


Asp 




460 










Leu 


He 


Met 


Asp 


Glu 


Glu 


475 










480 


His 


He 


Asp 


Trp 


Asp 


Ser 










495 




Ala 


Arg 


Phe 


Asp 


Gly 


Arg 








510 






Gin 


Glu 


He 


His 


Arg 


Arg 






525 








Thr 


Ala 


Val 


Lys 


Phe 


Leu 




540 










Leu 


Ser 


He 


Phe 


Gly 


Lys 


555 










560 


Leu 


Lys 


Ser 


Asp 


Glu 


Lys 



575 



wo 02/31134 



PCT/USOl/31874 



Leu 


Phe 


Ijys 


Cys 


Gly Ser Val 


val 


Ala 


Pro 


He 


Thr 


Asp 


Leu 


Lys 


Leu 








580 










585 










590 






Tyr 


Ala 


Ser 


Ala 


Phe 


Ser 


Glu 


Arg 


Tyr 


Leu 


Gly 


Met 


Pro 


Ser 


Lys 


Glu 






595 










600 










605 








Glu 


Ser 


Thr 


Tyr 


Gin 


Ala 


Ala 


Ser 


val 


Leu 


His 


Asn 


Val 


His 


Gly 


Leu 




610 










615 










620 










hys 


Glu 


Glu 


Asn 


lie 


Leu 


He 


He 


His 


Gly 


Thr 


Ala 


Asp 


Thr 


Lys 


Val 


625 










630 










635 










640 


His 


Phe 


Gin 


His 


Ser 


Ala 


Glu 


Leu 


He 


Lys 


His 


Leu 


He 


Lys 


Ala 


Gly 










645 










650 










655 




Val 


Asn 


Tyr 


Thr 


Met 


Gin 


Val 


Tyr 


Pro 


Asp 


Glu 


Gly 


His 


Asn 


Val 


Ser 








660 










665 










670 






Glu 


hys 


Ser 


Lys 


Tyr 


His 


Leu 


Tyr 


Ser 


Thr 


He 


Leu 


Lys 


Phe 


Phe 


Ser 






675 










680 










685 








Asp 


Cys 


Leu 


Lys 


Glu 


Glu 


He 


Ser 


Val 


Leu 


Pro 


Gin 


Glu 


Pro 


Glu 


Glu 



690 695 700 



Asp Glu 
705 



<210> 42 

<211> 4541 

<212> DNA 

<213> Homo sapiens 

<400> 42 



gkctykgtkg 
ctgaatatag 
aaagcatcaa 
aaacagattt 
gaagtttggg 
ggtgtccaag 
ataaagagca 
attgctgact 
ccagatggag 
atccctcggt 
ggtcaagtga 
ttggagctca 
tgggtaagca 
ctcacagtct 
acgtggctct 
atgacagtgc 
atccagagta 
ataaagatct 
tctcccagag 
atttcatgta 
aatcaacatt 
acggacaacc 
ctgaagaaga 
cctttacagt 
ataatggatg 
tccgtactca 
ttccagggtc 
gaccaaataa 
agcatttttg 
aagcttttta 
gctttctctg 
agtgtgctac 
gctgacacaa 
ggagtgaatt 



wtsmagatac 
aaacaaatgc 
gacattcagt 
ttcattattc 
agttaaatcc 
ggcagcagct 
gttcattgcg 
ggttatatga 
aaagacttgc 
ttactggagc 
acccaacaat 
tgccacctga 
ataccaagac 
gtgagaccac 
ctcagcagaa 
ctgttaagca 
aaagtgagca 
tggcatacga 
gaaggcagct 
atttcatgaa 
tcttattatt 
cagcaaaata 
agataggaaa 
tgtcccttcc 
aagaaccagg 
ttgacatgga 
tgaaaatttt 
cagctgtgaa 
gaaagggtta 
aatgtggatc 
aaagatacct 
ataatgttca 
aagttcattt 
atactatgca 



agatgtggtg 
taccacatta 
ttcaccagat 
gtatactgct 
tccagaagta 
gatttatatt 
actgacatct 
agaggaactc 
cttcctgatg 
gttgtatccc 
aaaattatat 
cagctttaaa 
tgtggtaaga 
tacaggtgct 
tsraggagccc 
agggggacgt 
aattaccgtg 
tgaaactact 
gtacagtgct 
agaacaatgt 
ctgtgaaggt 
ttttatattg 
gccagaaatt 
caaagatttt 
aggccagctg 
taatgtcatt 
gcaggagatt 
atttttgctg 
tggtggctat 
cgtggttgca 
tgggatgcca 
tggcttgaaa 
ccaacactca 
ggtctaccca 



tataaaagcg 
ttattggaaa 
ttaaaatatg 
tcatatgtga 
gaggactccg 
tttgaaaata 
tctggaaaag 
ctgcattctc 
ataaatgact 
aaaggaaagc 
gttgtaaacc 
tcaagagaat 
tggttaaacc 
tgtagtaaaa 
gtgttttcta 
ggagaatttc 
cggcatctga 
caaaaaattt 
tctactgaag 
acatattttg 
ccaagggtcc 
gaaagcaatt 
aaaatccttc 
atggaccgaa 
gttacagata 
gtagcaagat 
catcgaagat 
aaactgcctt 
attgcatcaa 
cctatcacag 
tctaaggaag 
gaagaaaata 
gcagaattaa 
gatgaaggtc 



agaatggaca 
acacaacttt 
tccttctggc 
tttacaacat 
tcttgcagta 
atatctacta 
aagaaataat 
acatcgccca 
ctttggtacc 
agtatccgta 
tgtatggacc 
actatatcac 
gacctcagaa 
aatatgagat 
gagacggcag 
accacatagc 
catcaggaaa 
actttctgag 
gattattgaa 
atgccagttt 
cagtggtcag 
ctatgctgaa 
atattgacga 
accagtatgc 
agttccatat 
ttgatggcag 
taggttcagt 
acattgactc 
tgatcttaaa 
acttgaaatt 
aaagcactta 
tattaataat 
tcaagcacct 
ataacgtatc 



tgtcattaaa 
tgtaaccttc 
atatgatgtc 
acacactagg 
cgcggcctgg 
tcaacctgat 
ttttaatggg 
ctggtggtca 
caccatggtt 
tcctaaggca 
aactcacact 
tatggttaaa 
catctccatc 
gacatcagat 
caaattcttt 
tatgttcctc 
ctgggaagtg 
cactgaatct 
tcgccaatgc 
tagtcccatg 
cctacatagt 
ggaagctatc 
ctatgaactt 
tcttctgtta 
tgactgggat 
aggaagtgga 
agaagtaaag 
caaaagatta 
atcagatgaa 
gtatgcctca 
ccaggcagcG 
tcatggaact 
aataaaagct 
tgagaagagc 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
120a 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
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aagtatcatc tctacagcac aatcctcaaa ttcttcagtg attgtttgaa ggaagaaata 2100 

tctgtgctac cacaggaacc agaagaagat gaataatgga ccgtatttat acagaactga 2160 

agggaatatt gaggctcaat gaaacctgac aaagagactg taatattgta gttgctccag 2220 

aatgtcaagg gcagcttacg gagatgtcac tggagcagca cgctcagaga cagtgaacta 22 80 

gcatttgaat acacaagtcc aagtctactg tgttgctagg ggtgcagaac ccgtttcttt 2340 

gtatgagaga ggtcaaaggg ttggtttcct gggagaaatt agttttgcat taaagtagga 24 00 

gtagtgcatg ttttcttctg ttatccccct gtttgttctg taactagttg ctctcatttt 24 60 

aatttcactg gccaccatca tctttgcata taatgcacaa tctatcatct gtcctacagt 2520 

ccctgatctt tcatggctga gctgcaatct aacactttac tgtaccttta taataagtgc 2580 

aattctttca ttgtctatta ttatgcttaa gaaaatattc agttaataaa aaacagagta 2640 

ttttatgtaa tttctgtttt taaaaagaca ttattaaatg ggtcaaagga catatagaaa 27O0 

tgtggatttc agcaccttcc aaagttcagc cagttatcag tagatacaat atctttaaat 2760 

gaacacacga gtgtatgtct cacaatatat atacacaagt gtgcatatac agttaatgaa 2820 

actatcttta aatgttattc atgctataaa gagtaaacgt ttgatgaatt agaagagatg 28 80 

ctcttttcca agctataatg gatgctttgt ttaatgagcc aaatatgatg aaacattttt 2940 

tccaattcaa attctagcta ttgctttcct ataaatgttt gggttgtgtt tggtattgtt 3000 

tttagtggtt aatagttttc cagttgcatt taattttttg aatatgatac cttgtcacat 3 060 

gtaaattaga tacttaaata ttaaattata gtttctgata aagaaatttt gttaacaatg 3120 

caatgccact gagtgctatt ttgctctttt ggtggagaag gcttttttca aaactcttgg 3180 

tccttttact tctttctctc agtgcagaat caattctcat tttcatcgta aaagcaaata 3240 

gctggattat ttcatttgcc agtttctatt tagtattcca tgcctgccca attcatctgt 33 00 

tactgtttaa tttcaattct tctggtgaga attagaaatg aaatattttt tattcattgg 3360 

ccaaaaagtt cacagacagc agtgtttgct atttactttg aattgaaggc acaaaatgca 3420 

tcaattcctg tgctgtgttg acttgcagta gtaagtaact gagagcataa aataaacctg 3480 

actgtatgaa gtcaatttaa gtgatgagaa catttaactt tggtgactaa agtcagaata 3540 

tcttctcact tcacttaagg gatcttccag aagatatcta aaagtctgta ataagcttag 3600 

aagttcagat aaatctaggc aggatactgc atttttgtgg ttttaaaaaa gtccttagga 3660 

cagactgaat tatcataact tatggcatca ggaggaaact ttaaaatatc aaggaatcac 3720 

tcagtcaccc tcctgttttg ttgaaggatc aaccccaaat tctgggtatt tgagtacatg 3780 

tgaatcatgg atttggtatt caactttttc cctggatgct ttggaatcgt gtcttccatg 3840 

ctccactggg ttcaatttaa aataggagag gctttctctt ctgaaagatc cattttaggt 3 900 

ctttttcaag aatagtgaac acatttttta acaaaataag ttgtaatttt aaaaggaaag 3960 

ttttgcctat tttattaaga tggaaatttc tttttaggct aatttgaaat ccaactgaag 4020 

ctttttaacc aatattttaa atttgaacca ctagagtttt ttatgatgca aatgattatg 4080 

ttgtctgaaa ggtgtggttt tattgaatgt ctatttgagt atcatttaaa aagtatttgc 4140 

cttttactgt catcatttct cttgttttat tattattatc aatgtttatc tatttttcaa 4200 

ttaatttaat acagtttcta atgtgaaaga catttttctg gaacccgttt tccccttaaa 4260 

cactaaagag acctcaagtg aaagcatatt gcttagtagg aaggtagaaa atgttaatcc 4320 

ctgcgattct ttgagtttta atgacagggt cattttcagt aaaggaaatg ctcaccaaca 4380 

catagtcacc aactattaaa ggaatcatgt gattggattt tcccctgtat acatgtaccc 4440 

ttggtcataa tcccactatt tcatacatat ttatgcattg ctagattttc ctaggactcc 4500 

aatagcatgc tttccaagtg ttattattcc cttaatgtta a 4541 

<210> 43 

<211> 691 

<212> PRT 

<213> Homo sapiens 

<400> 43 

Asp Thr Asp Val Val Tyr Lys Ser Glu Asn Gly His Val lie Lys Leu 

15 10 15 

Asn He Glu Thr Asn Ala Thr Thr Leu Leu Leu Glu Asn Thr Thr Phe 

20 25 30 

Val Thr Phe Lys Ala Ser Arg His Ser Val Ser Pro Asp Leu Lys Tyr 

35 40 45 

Val Leu Leu Ala Tyr Asp Val Lys Gin He Phe His Tyr Ser Tyr Thr 

50 55 60 

Ala Ser Tyr Val He Tyr Asn He His Thr Arg Glu Val Trp Glu Leu 



59 
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Asn Pro Pro Glu 

Val Gin Gly Gin 
100 

Gin Pro Asp lie 
115 

Glu Glu lie lie 
130 

Leu Leu His Ser 
145 

Leu Ala Phe Leu 

Pro Arg Phe Thr 
180 

Pro Lys Ala Gly 
195 

Leu Tyr Gly Pro 
210 

Lys Ser Arg Glu 
225 

Lys Ttir Val Val 

Thr Val Cys Glu 
260 

Thr Ser Asp Thr 
275 

Arg Asp Gly Ser 
290 

Arg Gly Glu Phe 
305 

Glu Gin lie Thr 

Lys lie Leu Ala 
340 

Glu Gly Leu Leu 
355 

Gin Cys Thr Tyr 
370 

Leu Leu Phe Cys 
385 

Thr Asp Asn Pro 

Lys Glu Ala He 
420 

Leu His He Asp 
435 

Asp Phe Met Asp 
450 

Glu Pro Gly Gly 
465 

Ser Val Leu lie 

Arg Gly Ser Gly 
500 

Arg Leu Gly Ser 
515 

Leu Leu Lys Leu 
530 



Val Glu Asp Ser 
85 

Gin Leu He Tyr 

Lys Ser Ser Ser 
120 

Phe Asn Gly He 
135 

His He Ala His 
150 

Met He Asn Asp 
165 

Gly Ala Leu Tyr 

Gin Val Asn Pro 
200 

Thr His Thr Leu 
215 

Tyr Tyr He Thr 
230 

Arg Trp Leu Asn 
245 

Thr Thr Thr Gly 

Trp Leu Ser Gin 
280 

Lys Phe Phe Met 
295 

His His He Ala 
310 

Val Arg His Leu 

325 

Tyr Asp Glu Thr 

Asn Arg Gin Cys 
360 

Phe Asp Ala Ser 
375 

Glu Gly Pro Arg 
390 

Ala Lys Tyr Phe 
405 

Leu Lys Lys Lys 

Asp Tyr Glu Leu 
440 

Arg Asn Gin Tyr 
455 

Gin Leu Val Thr 
470 

Asp Met Asp Asn 
485 

Phe Gin Gly Leu 

Val Glu Val Lys 
520 

Pro Tyr He Asp 
535 



Val Leu Gin Tyr 
90 

He Phe Glu Asn 
105 

Leu Arg Leu Thr 

Ala Asp Trp Leu 
140 

Trp Trp Ser Pro 
155 

Ser Leu Val Pro 
170 

Pro Lys Gly Lys 
185 

Thr He Lys Leu 

Glu Leu Met Pro 
220 

Met Val Lys Trp 
235 

Arg Pro Gin Asn 
250 

Ala Cys Ser Lys 
265 

Gin Asn Glu Glu 

Thr Val Pro Val 
300 

Met Phe Leu He 
315 

Thr Ser Gly Asn 
330 

Thr Gin Lys He 
345 

He Ser Cys Asn 

Phe Ser Pro Met 
380 

Val Pro Val Val 
395 

He Leu Glu Ser 
410 

He Gly Lys Pro 
425 

Pro Leu Gin Leu 

Ala Leu Leu Leu 
460 

Asp Lys Phe His 
475 

Val He Val Ala 
490 

Lys He Leu Gin 
505 

Asp Gin He Thr 

Ser Lys Arg Leu 
540 



Ala 


Ala 


Trp 


Gly 






95 




Asn 


He 


lyr 


Tyr 




110 






Ser 


Ser 


Gly 


Lys 


125 








Tyr 


Glu 


Glu 


Glu 


Asp 


Gly 


Glu 


Arq 








160 


Thr 


Met 


Val 


He 






175 




Gin 


Tyr 


Pro 


Tvr 




190 






Tvr 


Val 


Val 


Asn 


205 








Pro 


Asp 


Ser 


Phe 


Val 


Ser 


Asn 


Thr 








24 0 


He 


Ser 


He 


Leu 






255 




Lvs 


Tvx 


Glu 


Met 




270 






Pro 


Val 


Phe 


Ser 


285 








Lys 


Gin 


Glv 


Glv 


Gin 


Ser 


Lys 


Ser 








320 


Trp 


Glu 


Val 


He 






335 




Ser 


Ala 


Ser 


Thr 




350 






Phe 


Met 


Lys 


Glu 



365 

Asn Gin His Phe 

Ser Leu His Ser 
400 

Asn Ser Met Leu 
415 

Glu He Lys He 
430 

Ser Leu Pro Lys 
445 

He Met Asp Glu 

He Asp Trp Asp 
480 

Arg Phe Asp Gly 
495 

Glu He His Arg 
510 

Ala Val Lys Phe 
525 

Ser He Phe Gly 
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Lys 


Gly 


Tyr 


Gly 


Gly 


Tyr 


lie 


Ala 


Ser 


Met 


He 


Leu 


Lys 


Ser 


Asp 


Glu 


545 










550 










555 










560 


Lys 


Leu 


Phe 


Lys 


Cys 


Gly 


Ser 


Val 


Val 


Ala 


Pro 


He 


Thr 


Asp 


Leu 


Lys 








565 










570 










575 




Leu 


Tyr 


Ala 


Ser 


Ala 


Phe 


Ser 


Glu 


Arg 


Tyr 


Leu 


Gly 


Met 


Pro 


Ser 


Lys 






580 










585 










590 






Glu 


Glu 


Ser 
595 


Thr 


Tyr 


Gin 


Ala 


Ala 
600 


Ser 


Val 


Leu 


His 


Asn 
605 


Val 


His 


Gly 


Leu 


Lys 
6X0 


Glu 


Glu 


Asn 


lie 


Leu 
615 


lie 


He 


His 


Gly 


Thr 
620 


Ala 


Asp 


Thr 


Lys 


Val 


His 


Phe 


Gin 


His 


Ser 


Ala 


Glu 


Leu 


He 


Lys 


His 


Leu 


He 


Lys 


Ala 


625 










630 










635 






His 




640 


Gly 


Val 


Asn 


Tyr 


Thr 


Met 


Gin 


Val 


Tyr 


Pro 


Asp 


Glu 


Gly 


Asn 


Val 








645 










650 










655 




Ser 


Glu 


Lys 


Ser 


Lys 


Tyr 


His 


Leu 


Tyr 


Ser 


Thr 


He 


Leu 


Lys 


Phe 


Phe 






660 










665 










670 






Ser 


Asp 


Cys 
675 


Leu 


Lys 


Glu 


Glu 


He 
680 


Ser 


Val 


Leu 


Pro 


Gin 
685 


Glu 


Pro 


Glu 



Glu Asp Glu 
690 



<210> 44 

<211> 4496 

<212> DNA 

<213> Homo sapiens 

<400> 44 



gkctykgtkg 
ctgaatatag 
aaagcatcaa 
aaacagattt 
gaagtttggg 
ggtgtccaag 
ataaagagca 
attgctgact 
ccagatggag 
atccctcggt 
ggtcaagtga 
ttggagctca 
tgggtaagca 
ctcacagtct 
acgtggctct 
atgacagtgc 
atccagagta 
ataaagatct 
ttgaatcgcc 
agttttagtc 
gtcagcctac 
ctgaaggaag 
gacgactatg 
tatgctcttc 
catattgact 
ggcagaggaa 
tcagtagaag 
gactccaaaa 
ttaaaatcag 
aaattgtatg 
acttaccagg 
ataattcatg 



wt smagatac 
aaacaaatgc 
gacattcagt 
ttcattattc 
agttaaatcc 
ggcagcagct 
gttcattgcg 
ggttatatga 
aaagacttgc 
ttactggagc 
acccaacaat 
tgccacctga 
ataccaagac 
gtgagaccac 
ctcagcagaa 
ctgttaagca 
aaagtgagca 
tggcatacga 
aatgcatttc 
ccatgaatca 
atagtacgga 
ctatcctgaa 
aacttccttt 
tgttaataat 
gggattccgt 
gtggattcca 
taaaggacca 
gattaagcat 
atgaaaagct 
cctcagcttt 
cagccagtgt 
gaactgctga 



agatgtggtg 
taccacatta 
ttcaccagat 
gtatactgct 
tccagaagta 
gatttatatt 
actgacatct 
agaggaactc 
cttcctgatg 
gttgtatccc 
aaaattatat 
cagctttaaa 
tgtggtaaga 
tacaggtgct 
tgaggagccc 

agggggacgt 

aattaccgtg 
tgaaactact 
atgtaatttc 
acatttctta 
caacccagca 
gaagaagata 
acagttgtcc 
ggatgaagaa 
actcattgac 
gggtctgaaa 
aataacagct 
ttttggaaag 
ttttaaatgt 
ctctgaaaga 
gctacataat 
cacaaaagtt 



tataaaagcg 
ttattggaaa 
ttaaaatatg 
tcatatgtga 
gaggactccg 
tttgaaaata 
tctggaaaag 
ctgcattctc 
ataaatgact 
aaaggaaagc 
gttgtaaacc 
tcaagagaat 
tggttaaacc 
tgtagtaaaa 
gtgttttcta 
ggagaatttc 
cggcatctga 
caaaaaatca 
atgaaagaac 
ttattctgtg 
aaatatttta 
ggaaagccag 
cttcccaaag 
ccaggaggcc 
atggataatg 
attttgcagg 
gtgaaatttt 
ggttatggtg 
ggatccgtgg 
taccttggga 
gttcatggct 
catttccaac 



agaatggaca 
acacaacttt 
tccttctggc 
tttacaacat 
tcttgcagta 
atatctacta 
aagaaataat 
acatcgccca 
ctttggtacc 
agtatccgta 
tgtatggacc 
actatatcac 
gacctcagaa 
aatatgagat 
gagacggcag 
accacatagc 
catcaggaaa 
gtgcttctac 
aatgtacata 
aaggtccaag 
tattggaaag 
aaattaaaat 
attttatgga 
agctggttac 
tcattgtagc 
agattcatcg 
tgctgaaact 
gctatattgc 
ttgcacctat 
tgccatctaa 
tgaaagaaga 
actcagcaga 



tgtcattaaa 
tgtaaccttc 
atatgatgtc 
acacactagg 
cgcggcctgg 
tcaacctgat 
ttttaatggg 
ctggtggtca 
caccatggtt 
tcctaaggca 
aactcacact 
tatggttaaa 
catctccatc 
gacatcagat 
caaattcttt 
tatgttcctc 
ctgggaagtg 
tgaaggatta 
ttttgatgcc 
ggtcccagtg 
caattctatg 
ccttcatatt 
ccgaaaccag 
agataagttc 
aagatttgat 
aagattaggt 
gccttacatt 
atcaatgatc 
cacagacttg 
ggaagaaagc 
aaatatatta 
attaatcaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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<210> 45 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 45 

cggtaccatg gcagcagcaa tggaaacag 

<210> 46 

<211> 39 

<212> DNA 

<213> Homo sapiens 

<400> 46 

ggagctcgcg gccgctcata tcacttttag agcagcaat 



1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 



cacctaataa aagctggagt gaattatact atgcaggtct acccagatga aggtcataac 
aStctgaga agagcalgL ?catctctac agcacaatcc tcaaattctt cagtgattgt 
?tgaagS^^ aaa?atc?gt gctaccacag gaaccagaag aagatgaata atggaccgta 
tt?atacagl actgaaggga atattgaggc tcaatgaaac ctgacaaaga gactgtaata 
ttqtagttic tccagaatgt caagggcagc ttacggagat gtcactggag cagcacgctc 
aaSSS^ aactagcatt tgaatacaca agtccaagtc tactgtgttg ctaggggtgc 
alalccclt? tctttltatg agagaggtca aagggttggt ttcctgggag aaattagttt 
tqcattaaag taggagtagt gcatgttttc ttctgttatc cccctgtttg ttctgtaact 
aittgctcJc at??tLt?t cactggccac catcatcttt gcatataatg cacaatctat 
catctQtcct acagtccctg atctttcatg gctgagctgc aatctaacac tttactgtac 
ctttaSata JgScaattc tttcattgtc tattattatg cttaagaaaa tattcagtta 
ataaaaaaci gagtatttta tgtaatttct gtttttaaaa agacattatt aaatgggtca 
aaaScata? agaaatgtgg a?ttcagcac cttccaaagt tcagccagtt atcagtagat 
^caa'tcS taaatgL?? cacgagtgta tgtctcacaa tatatataca -agtgtgca 
tatacagtta atgaaactat ctttaaatgt tattcatgct ataaagagta aacgtttgat 
aStagaag agltgctctt ttccaagcta taatggatgc tttgtttaat gagccaaata 
?SSaaca t?tt?tccaa ttcaaattct agctattgct ttcctataaa tgtttgggtt 
a?at?tqqta ttgtttttag tggttaatag ttttccagtt gcatttaatt ttttgaatat 3000 
|a?acc??g? cacatgtaaa t?Igatactt aaatattaaa ttatagtttc tgataaagaa 3060 
aS^tgttL caatgcaatg ccactgagtg ctattttgct cttttggtgg -gj^^gcttt 3120 
i-ttcaaaact cttggtcctt ttacttcttt ctctcagtgc agaatcaatt ctcattttca 
tcq?aaaa^^ aaSgctgg attatttcat ttgccagttt ctatttagta ttccatgcct 
gccc::???a tctgt?ac?g tttaatttca attcttctgg tgagaattag -atgaaata 
ht-ttttattc attggccaaa aagttcacag acagcagtgt ttgctattta ctttgaatcg 
aaggcacaaa atgcltcaat tcctgtgctg tgttgacttg cagtagtaag taactgagag 
cS?aaaataa acctgactgt atgaagtcaa tttaagtgat gagaacattt aactttggtg 
actlaagtcJ gaatatcttc tcIctLact taagggatct tccagaagat atctaaaagt 
ctgtaifaag ?ttagaagtt cagataaatc taggcaggat actgcatttt tgtggtttta 
^alaaatcct taggacagac tgaattatca taacttatgg catcaggagg aaactttaaa 
;?atcaagga at^lctclgt cLcctcctg ttttgttgaa ggatcaaccc caaattctgg 
gtat"ga|t acatgtgalt catggatttg gtattcaact ttttccctgg atgctttgga 
atcgtgLtt ccatgctcca ctgggttcaa tttaaaatag gagaggcttt ctcttctgaa 
S^ccattt tagglctttt tcaagaatag tgaacacatt "ttaacaaa ataagttgta 
attttaaaag gaaagttttg cctattttat taagatggaa atttcttttt aggctaattt 
aaaatccaa? tgaagctttt taaccaatat tttaaatttg aaccactaga gttttttatg 
Sqcaaatga tS^ttgtc tgaaaggtgt ggttttattg aatgtctatt tgagtatcat 
ttlaaaSJ tttgcctttt actgtcatca tttctcttgt tttattatta ttatcaatgt 
tt"c?a?t? ttcLttaat ttaltacagt ttctaatgtg aaagacattt ttctggaacc 
cgttScccc ttaaacacta aagagacctc aagtgaaagc atattgctta Staggaaggt 
!«»«aatatt aatccctgcg attctttgag ttttaatgac agggtcattt tcagtaaagg 
ilatgctcac caacaca?a| tcaccaacti ttaaaggaat catgtgattg gattttcccc 
Sa?acatg tacccttgg? cataatccca ctatttcata catatttatg cattgctaga 
t?ttcc?ag| actccaa?Ig catgctttcc aagtgttatt attcccttaa tgttaa 



3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4496 
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<210> 47 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 47 

caagctttat cacttttaga gcagcaa 

<210> 48 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 48 

cacattcttg ctgcatcagt ca 

<210> 49 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 49 

ttgggtcatc ttcaggactt ga 

<210> 50 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 50 

caagcttacc atggccacca ccgggac 

<210> 51 

<211> 37 

<212> DNA 

<213> Homo sapiens 

<400> 51 

cggatccgcg gccgctcaga ggtattcctg tagaaag 

<210> 52 

<211> 27 

<212> DNA 

<213> Homo sapiens 

<400> 52 

cggatccagg tattcctgta gaaagtg 

<210> 53 

<211> 20 

<212> DNA 

<213> Homo sapiens 

<400> 53 

tacgccgtgg ttgtgattga 

<210> 54 
<211> 20 
<212> DNA 
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<213> Homo sapiens 
<400> 54 

ccatacttct cggccacgaa 2 0 

<210> 55 

<211> 19 

<212> DNA 

<213> Homo sapiens 

<400> 55 

gcctgggatt gtgcactgt 

<210> 56 

<211> 29 

<212> DNA 

<213> Homo sapiens 

<400> 56 

gtgtattcaa atgctagttc actgtctct 29 

<210> 57 

<211> 22 

<212> DNA 

<213> Homo sapiens 

<400> 57 

agctagcact gtccagggtc ct 22 

<210> 58 

<211> 25 

<212> DNA 

<213> Homo sapiens 

<400> 58 

agggcccttc atcttcttct ggttc 25 

<210> 59 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 59 

Val Glu Asp Asp Val Met Glu Arg Gin Arg Leu lie Glu Ser Val Pro 
1 5 10 15 

Asp Ser Val 

<210> 60 

<211> 19 

<212> PRT 

<213> Homo sapiens 

<400> 60 

Ser Thr Glu Asn Glu Glu Gin Arg Leu Ala Ser Ala Arg Ala Val Pro 
1 5 10 15 

Arg Asn Val 

<210> 61 
<211> 15 
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<212> PRT 

<213> Homo sapiens 

<400> 61 

Lys Glu Ala lie Leu Lys Lys Lys lie Gly Lys Pro Glu lie Lys 
1 5 10 15 



